Systems and methods for vehicle guidance using depth map generation

ABSTRACT

This disclosure relates to systems and methods for vehicle guidance. Stereo images may be obtained at different times using a stereo image sensor. A depth image may be determined based on an earlier obtained pair of stereo images. The depth image may be refined based on predictions of an earlier stereo image and a later obtained stereo image. Depth information for an environment around a vehicle may be obtained. The depth information may characterize distances between the vehicle and the environment around the vehicle. A spherical depth map may be generated from the depth information. Maneuver controls for the vehicle may be provided based on the spherical depth map.

CROSS-REFERENCE TO RELATED APPLICATION(S)

This application is a continuation of U.S. application patent Ser. No.16/360,097, filed on Mar. 21, 2019, which is a continuation of U.S.application patent Ser. No. 15/204,375, filed Jul. 7, 2016, now U.S.Pat. No. 10,269,257, which claims the benefit of U.S. ProvisionalApplication No. 62/203,745, filed Aug. 11, 2015, U.S. ProvisionalApplication No. 62/203,765, filed Aug. 11, 2015, and U.S. ProvisionalApplication No. 62/203,754, filed Aug. 11, 2015, the entire disclosuresof which are hereby incorporated by reference.

TECHNICAL FIELD

This disclosure relates to systems and methods for vehicle guidance.

BACKGROUND

A moving platform carrying a sensor to provide control and guidance forthe moving platform is known. In such a system, the type and/or theextent of control and guidance provided may be limited by resourcecosts. For example, with respect to unmanned aerial vehicles, processingand/or storage resource costs of detecting objects limit the objectsidentified to those objects that are static, i.e., not moving. Theamount of processing and/or storage resources necessary to process depthinformation for the environment of the unmanned aerial vehicles may belimited.

SUMMARY

In some aspects of the disclosure, a system configured to detect amoving object may include one or more of an image sensor, a motion andorientation sensor, one or more hardware-implemented processors and/orother components. Some or all of the system may be installed in avehicle and/or be otherwise coupled with the vehicle. In someimplementations, the one or more hardware-implemented processors may belocated remotely from the vehicle. In some implementations, the one ormore hardware-implemented processors may be located on or in thevehicle. In some implementations, the image sensor may be locatedremotely from the vehicle. In some implementations, the image sensor maybe located on or in the vehicle. In some implementations, the imagesensor, the motion and orientation sensor, and the one or morehardware-implemented processors may be carried on or in the vehicle, andthe field of view of the image sensor may be a function of the positionand orientation of the vehicle.

The image sensor may be configured to generate visual output signalsconveying visual information within a field of view of the image sensor.The image sensor may include one or more of a charge-coupled devicesensor, an active pixel sensor, a complementary metal-oxidesemiconductor sensor, an N-type metal-oxide-semiconductor sensor, and/orother image sensors.

The motion and orientation sensor may be configured to generate motionand orientation output signals conveying motion information andorientation information of the image sensor. The motion and orientationsensor may include one or more of an accelerometer, a tilt sensor, aninclination sensor, an angular rate sensor, a gyroscope, an inertialmeasurement unit, a compass, a magnetometer, a pressure sensor, abarometer, a global positioning system device, a distance sensor, and/orother motion and orientation sensors.

The hardware-implemented processor(s) may include one or morecomputing/processing devices with one or more algorithms/logicsimplemented in hardware to perform one or more functions.Algorithms/logics may be implemented in hardware via machine-readableinstructions (e.g., a hardware description language file, a netlist,etc.). As a non-limiting example, a hardware-implemented processor(s)may include one or more of a field-programmable gate array, anapplication-specific integrated circuit, and/or otherhardware-implemented processors.

The hardware-implemented processor(s) may be configured bymachine-readable instructions to execute one or more functioncomponents. The function components may include one or more of animaging component, a predicted change component, a predicted imagingcomponent, an actual change component, an optical flow component, anoptical flow adjustment component, a detection component, a behaviorcomponent, and/or other function components.

The imaging component may be configured obtain a first image. The firstimage may be determined based on the visual output signals such that thevisual output signals used to determine the first image are generated bythe image sensor at a first time. The imaging component may beconfigured to obtain a second image. The second image may be determinedbased on the visual output signals such that the visual output signalsused to determine the second image are generated by the image sensor ata second time that is subsequent to the first time.

The predicted change component may be configured to obtain a predictedchange in the field of view of the image sensor between the first timeand the second time. The predicted change in the field of view of theimage sensor between the first time and the second time may bedetermined based on the motion and orientation output signals.

The predicted imaging component may be configured to determine apredicted first image by adjusting the second image based on thepredicted change in the field of view.

The actual change component may be configured to obtain an actual changein the field of view of the image sensor between the first time and thesecond time. The actual change in the field of view of the image sensorbetween the first time and the second time may be determined based on acomparison of the first image with the predicted first image. In someimplementations, the determination of the actual change in the field ofview may include, when comparing the first image with the predictedfirst image, disregarding portions of the first image and the predictedfirst image.

The optical flow component may be configured to obtain optical flowbetween the first image and the second image. In some implementations,obtaining the optical flow between the first image and the second imagemay include obtaining individual optical flow vectors between the firstimage and the second image.

The optical flow adjustment component may be configured to obtainadjusted optical flow. The adjusted optical flow may be determined basedon the actual change in the field of view. The adjusted optical flow maybe determined by adjusting the optical flow to account for the actualchange in the field of view. In some implementations, adjusting theoptical flow to account for the actual change in the field of view mayinclude adjusting the individual optical flow vectors to offset theactual change in the field of view.

The detection component may be configured to obtain the presence of themoving object. The presence of the moving object may be detected basedon the adjusted optical flow. In some implementations, the detection ofthe presence of the moving object may be made through identification ofa cluster of the adjusted optical flow vectors. In some implementations,the detection component may be configured to obtain an object type ofthe moving object. The object type of the moving object may bedetermined based on the adjusted optical flow.

In some implementations, the behavior component may be configured tocontrol the vehicle to avoid the moving object. In some implementations,the behavior component may be configured to control the vehicle tofollow the moving object. In some implementations, the behaviorcomponent may be configured to effectuate one or more operatingbehaviors of a vehicle based on the identified object type.

In some implementations, the one or more operating behaviors of thevehicle may include the vehicle maintaining a minimum distance betweenthe moving object and/or a maximum distance from the moving object. Insome implementations, the one or more operating behaviors of the vehiclemay include the vehicle maintaining a minimum speed and/or a maximumspeed. In some implementations, the one or more operating behaviors ofthe vehicle may include the vehicle maintaining the moving object withinthe field of view of the image sensor. In some implementations, the oneor more operating behaviors of the vehicle may include the vehiclefacilitating wireless communication of the visual information to aremote computing device.

In some aspects of the disclosure, a system for collision avoidance ofan unmanned aerial vehicle may include one or more processors and/orother components. In some implementations, the system may include one ormore stereo image sensors.

The stereo image sensor(s) may be carried by the unmanned aerialvehicle. The stereo image sensor(s) may include a first image sensor, asecond image sensor, and/or other components. The first image sensor maybe configured to generate first visual output signals conveying visualinformation within a field of view of the first image sensor. The firstimage sensor may include one or more of a charge-coupled device sensor,an active pixel sensor, a complementary metal-oxide semiconductorsensor, an N-type metal-oxide-semiconductor sensor, and/or other imagesensors. The second image sensor may be configured to generate secondvisual output signals conveying visual information within a field ofview of the second image sensor. The second image sensor may include oneor more of a charge-coupled device sensor, an active pixel sensor, acomplementary metal-oxide semiconductor sensor, an N-typemetal-oxide-semiconductor sensor, and/or other image sensors.

The processor(s) may be configured by machine-readable instructions toexecute one or more computer program components. The computer programcomponents may include one or more of a depth information component, aspherical depth map component, a maneuver controls component, and/orother computer program components. In some implementations, theprocessor(s) may include one or more of a hardware-implementedprocessor, a software-implemented processor, and/or other processors. Insome implementations, the hardware-implemented processor(s) may belocated remotely from the software-implemented processor(s).

The depth information component may be configured to obtain depthinformation for an environment around the unmanned aerial vehicle and/orother information. The depth information may characterize one or moredistances between the unmanned aerial vehicle and the environment aroundthe unmanned aerial vehicle. The depth information may include one ormore of distance information, disparity information, and/or other depthinformation. In some implementations, the depth information componentmay be configured to compare the first visual information with thesecond visual information to determine the depth information and/orother information.

The spherical depth map component may be configured to generate aspherical depth map from the depth information and/or other information.A center of the spherical depth map may coincide with a location of theunmanned aerial vehicle. The spherical depth map may represent distancesto closest surfaces of the environment around the unmanned aerialvehicle as a function of longitude angles and latitude angles.

In some implementations, the spherical depth may be divided into mapcells corresponding to the longitude angles and the latitude angles. Themap cells may include a first map cell corresponding to a firstlongitude angle and a first latitude angle. In some implementations,generating the spherical depth map may include reducing athree-dimensional map of a surrounding into the spherical depth map.

In some implementations, generating the spherical depth map may includedetermining distance values for the map cells. The distance values maybe determined based on the distances to the closest surfaces of theenvironment around the unmanned aerial vehicle and/or other information.A first distance value for the first map cell may be determined based ona first distance to a first closest surface of the environment aroundthe unmanned aerial vehicle at the first longitude angle and the firstlatitude angle and/or other information.

In some the determination of distance values may disregard distances tosurfaces of the environment around the unmanned aerial vehicle atindividual longitude angles and individual latitude angles that aregreater than the distances to the closest surfaces of the environmentaround the unmanned aerial vehicle at the individual longitude anglesand the individual latitude angles. In some implementations, thedistance values may correspond only to the closest surfaces of theenvironment around the unmanned aerial vehicle at individual longitudeangles and individual latitude angles.

In some implementations, the spherical depth map may include one or moredead zones in one or more polar regions. The distance values may not bedetermined for the map cells in the dead zone(s).

In some implementations, the spherical depth map component may beconfigured to, responsive to detecting the unmanned aerial vehicletraveling a threshold distance, transform the spherical depth map. Thespherical depth may be transformed such that the center of the sphericaldepth map coincides with the location of the unmanned aerial vehicle.

The maneuver controls component may be configured to provide maneuvercontrols for the unmanned aerial vehicle. The maneuver controls for theunmanned aerial vehicle may be based on the spherical depth map. In someimplementations, the maneuver controls for the unmanned aerial vehiclemay include controlling the unmanned aerial vehicle to avoid one or moreobjects in the environment around the unmanned aerial vehicle.

In some aspects of the disclosure, a system for collision avoidance ofan unmanned aerial vehicle may include one or more processors and/orother components. In some implementations, the system may include one ormore stereo image sensors. In some implementations, the system mayinclude one or more motion and orientation sensors.

The stereo image sensor(s) may be carried by the unmanned aerialvehicle. The stereo image sensor(s) may include a first image sensor, asecond image sensor, and/or other components. The first image sensor maybe configured to generate first visual output signals conveying visualinformation within a field of view of the first image sensor. The firstimage sensor may include one or more of a charge-coupled device sensor,an active pixel sensor, a complementary metal-oxide semiconductorsensor, an N-type metal-oxide-semiconductor sensor, and/or other imagesensors. The second image sensor may be configured to generate secondvisual output signals conveying visual information within a field ofview of the second image sensor. The second image sensor may include oneor more of a charge-coupled device sensor, an active pixel sensor, acomplementary metal-oxide semiconductor sensor, an N-typemetal-oxide-semiconductor sensor, and/or other image sensors.

The motion and orientation sensor(s) may be configured to generatemotion and orientation output signals conveying motion information andorientation information of the unmanned aerial vehicle. The motion andorientation sensor may include one or more of an accelerometer, a tiltsensor, an inclination sensor, an angular rate sensor, a gyroscope, aninertial measurement unit, a compass, a magnetometer, a pressure sensor,a barometer, a global positioning system device, a distance sensor,and/or other motion and orientation sensors.

The processor(s) may be configured by machine-readable instructions toexecute one or more computer program components. The computer programcomponents may include one or more of an object detection component, adepth information component, a depth map component, a physical modelinformation component, a predicted path component, a collision detectioncomponent, a collision avoidance component, and/or other computerprogram components.

The object detection component may be configured to detect one or morestationary objects in an environment around the unmanned aerial vehicle.The object detection component may be configured to detect one or moremoving objects in the environment around the unmanned aerial vehicle. Insome implementations, the object detection component may be configuredto identify one or more object types of one or more detected movingobjects.

The depth information component may be configured to obtain depthinformation for an environment around the unmanned aerial vehicle and/orother information. The depth information may characterize one or moredistances between the unmanned aerial vehicle and the environment aroundthe unmanned aerial vehicle. The environment around the unmanned aerialvehicle may include one or more detected stationary objects and/or oneor more detected moving objects. The depth information may include oneor more of distance information, disparity information, and/or otherdepth information. In some implementations, the depth informationcomponent may be configured to compare the first visual information withthe second visual information to determine the depth information and/orother information.

The depth map component may be configured to generate a depth map fromthe depth information and/or other information. The depth map maycharacterize distances between the unmanned aerial vehicle and one ormore detected stationary objects and/or one or more detected movingobjects. In some implementations, the depth may include a sphericaldepth map and/or other depth maps.

The accuracy of the depth map may be characterized by a depth mapaccuracy and/or other information. The depth map accuracy may provideinformation about accuracy of the distances between the unmanned aerialvehicle and the detected objects characterized by the depth map withrespect to actual distances between the unmanned aerial vehicle and thedetected objects. The depth map accuracy may provide information aboutaccuracy of the distance between the unmanned aerial vehicle and one ormore detected stationary objects and/or one or more detected movingobjects characterized by the depth map with respect to actual distancesbetween the unmanned aerial vehicle and one or more detected stationaryobjects and/or one or more detected moving object.

The physical model information component may be configured to obtainvehicle physical model information and/or other information. The vehiclephysical model information may characterize a motion of the unmannedaerial vehicle. In some implementations, the vehicle physical modelinformation may be based on the motion and orientation informationand/or other information. In some implementations, the vehicle physicalmodel information may be based on one or more of a weight of theunmanned aerial vehicle, a size of the unmanned aerial vehicle, a shapeof the unmanned aerial vehicle, a linear speed of the unmanned aerialvehicle, a linear acceleration of the unmanned aerial vehicle, a lineardirection of the unmanned aerial vehicle, an angular speed of theunmanned aerial vehicle, an angular acceleration of the unmanned aerialvehicle, an angular direction of the unmanned aerial vehicle, a motioninstruction for the unmanned aerial vehicle, an environmental conditionaround the unmanned aerial vehicle, and/or other parameters.

The accuracy of the vehicle physical model information may becharacterized by a vehicle physical model accuracy and/or otherinformation. The vehicle physical model accuracy may provide informationabout accuracy of the motion of the unmanned aerial vehiclecharacterized by the vehicle physical model information with respect toan actual motion of the unmanned aerial vehicle.

The physical model information component may be configured to,responsive to detection of one or more moving objects, obtain movingobject physical model information and/or other information. The movingobject physical model information may characterize one or more motionsof one or more moving objects. In some implementations, the movingobject physical model information may be based on one or more objecttypes of one or more detected moving objects and/or other information.

The accuracy of the moving object physical model information may becharacterized by a moving object physical model accuracy and/or otherinformation. The moving object physical model accuracy may provideinformation about accuracy of the motion of one or more moving objectscharacterized by the moving object physical model information withrespect to the actual motion of one or more moving objects.

The predicted path component may be configured to determine a predictedvehicle path based on the vehicle physical model information, thevehicle physical model accuracy, and/or other information. The predictedvehicle path may include predicted location(s) of the unmanned aerialvehicle at one or more future times.

The predicted path component may be configured to, responsive todetection of one or more moving objects, determine one or more predictedmoving object paths based on the depth map, the depth map accuracy, themoving object physical model information, the moving object physicalmodel accuracy, and/or other information. The predicted moving objectpath(s) may include predicted location(s) of the moving object(s) at oneor more future times.

The collision detection component may be configured to, responsive todetecting one or more stationary objects, determine whether the unmannedaerial vehicle moving in the predicted vehicle path may collide with oneor more detected stationary objects based on the depth map, the depthmap accuracy, the predicted vehicle path, and/or other information. Oneor more collisions may be determined at one or more locations and/or atone or more future times.

The collision detection component may be configured to, responsive todetection of one or more moving objects, determine whether the unmannedaerial vehicle moving in the predicted vehicle path may collide with oneor more detected moving objects based on the predicted vehicle path, thepredicted moving object path, and/or other information. One or morecollisions may be determined at one or more locations and/or at one ormore future times.

The collision avoidance component may be configured to, responsive todetermination of one or more collisions of the unmanned aerial vehiclemoving in the predicted vehicle path, change a velocity of the unmannedaerial vehicle to avoid one or more collision. In some implementations,changing the velocity of the unmanned aerial vehicle may includechanging one or more speeds of the unmanned aerial vehicle. In someimplementations, changing the velocity of the unmanned aerial vehiclemay include changing one or more directions of the unmanned aerialvehicle to move the unmanned aerial vehicle in a path that deviates fromthe predicted vehicle path.

In some aspects of the disclosure, a system for estimating an ego-motionmay include one or more of a hardware-implemented processor, a stereoimage sensor, a motion and orientation sensor, and/or other components.

The stereo image sensor(s) may include a first image sensor, a secondimage sensor, and/or other components. The first image sensor may beconfigured to generate first visual output signals conveying visualinformation within a field of view of the first image sensor. The firstimage sensor may include one or more of a charge-coupled device sensor,an active pixel sensor, a complementary metal-oxide semiconductorsensor, an N-type metal-oxide-semiconductor sensor, and/or other imagesensors. The second image sensor may be configured to generate secondvisual output signals conveying visual information within a field ofview of the second image sensor. The second image sensor may include oneor more of a charge-coupled device sensor, an active pixel sensor, acomplementary metal-oxide semiconductor sensor, an N-typemetal-oxide-semiconductor sensor, and/or other image sensors.

The motion and orientation sensor(s) may be configured to generatemotion and orientation output signals conveying motion information andorientation information of the stereo image sensor. The motion andorientation sensor may include one or more of an accelerometer, a tiltsensor, an inclination sensor, an angular rate sensor, a gyroscope, aninertial measurement unit, a compass, a magnetometer, a pressure sensor,a barometer, a global positioning system device, a distance sensor,and/or other motion and orientation sensors.

The hardware-implemented processor(s) may include one or morecomputing/processing devices with one or more algorithms/logicsimplemented in hardware to perform one or more functions.Algorithms/logics may be implemented in hardware via machine-readableinstructions (e.g., a hardware description language file, a netlist,etc.). As a non-limiting example, a hardware-implemented processor(s)may include one or more of a field-programmable gate array, anapplication-specific integrated circuit, and/or otherhardware-implemented processors.

The hardware-implemented processor(s) may be configured bymachine-readable instructions to execute one or more functioncomponents. The function components may include one or more of animaging component, a depth image component, a predicted motioncomponent, a predicted imaging component, an error component, an actualmotion component, and/or other function components.

The imaging component may be configured to obtain one or more imagesand/or other information. The imaging component may be configured toobtain a first image, a second image, a third image, a fourth image,and/or other images. The first image may be determined based on thefirst visual output signals such that the first visual output signalsused to determine the first image are generated by the first imagesensor at a first time. The second image may be determined based on thesecond visual output signals such that the second visual output signalsused to determine the second image are generated by the second imagesensor at the first time.

The third image may be determined based on the first visual outputsignals such that the first visual output signals used to determine thefirst image are generated by the first image sensor at a second timethat is subsequent to the first time. The fourth image may be determinedbased on the second visual output signals such that the second visualoutput signals used to determine the second image are generated by thesecond image sensor at the second time.

The imaging component may be configured to undistort and rectify one ormore images. The imaging component may be configured to undistort andrectify the first image, the second image, and/or other images. Theimaging component may be configured to undistort and rectify the thirdimage, the fourth image, and/or other images.

The depth image component may be configured to determine one or moredepth images based on one or more comparisons of images. The depth imagecomponent may be configured to determine a depth image based on acomparison of the first image and the second image. The depth image mayinclude pixel values characterizing distances between the stereo imagesensor and objects within the first image and the second image.

In some implementations, the depth image may be determined using one ormore parallel algorithms and/or other algorithms. In someimplementations, the depth image may be determined using one or moreblock matching algorithms and/or other algorithms. In someimplementations, the depth image may be determined using one or moresemi-global block matching algorithms and/or other algorithms.

The predicted motion component may be configured to obtain one or morepredicted motion of the stereo image sensor between different times. Thepredicted motion component may be configured to obtain a predictedmotion of the stereo image sensor between the first time and the secondtime. The predicted motion of the stereo image sensor may be determinedbased on the motion and orientation output signals. The motion andorientation output signals may convey motion information and orientationinformation of the stereo image sensor. In some implementations, thepredicted motion may be determined based on visual odometry.

The predicted imaging component may be configured to determine one ormore predicted images based on one or more predicted motion of thestereo image sensor, one or more depth images, and/or other information.The predicted imaging component may be configured to determine apredicted third image by adjusting the first image based on thepredicted motion, the depth image, and/or other information. Thepredicted imaging component may be configured to determine a predictedfourth image by adjusting the second image based on the predictedmotion, the depth image, and/or other information.

The error component may be configured to determine one or more errordistributions of one or more predicted motion by comparing one or moreimages with one or more predicted images. The error component may beconfigured to determine an error distribution of the predicted motion bycomparing the third image with the predicted third image and the fourthimage with the predicted fourth image.

The actual motion component may be configured to obtain one or moreestimated actual motion of the stereo image sensor between differenttimes. The actual motion component may be configured to obtain anestimated actual motion of the stereo image sensor between the firsttime and the second time. The estimated actual motion may be determinedby adjusting the predicted motion to reduce the error distribution ofthe predicted motion. In some implementations, the predicted motion maybe adjusted based on one or more absolute error thresholds and/or otherinformation. In some implementations, the predicted motion may beadjusted based on one or more relative error thresholds and/or otherinformation.

In some implementations, the estimated actual motion of the stereo imagesensor between the first time and the second time may include anestimated position of the stereo image sensor at the second time, anestimated orientation of the stereo image sensor at the second time, anestimated velocity of the stereo image sensor at the second time, and/orother estimated actual motion of the stereo image sensor.

(01) In some implementations, the estimated actual motion of the stereoimage sensor between the first time and the second time may be used toestimate one or more of an estimated gravity-aligned position of thestereo image sensor at the second time, an estimated gravity-alignedorientation of the stereo image sensor at the second time, an estimatedgravity-aligned velocity of the stereo image sensor at the second time,and/or other estimated gravity-aligned motion of the stereo imagesensor.

In some aspects of the disclosure, a system for refining a depth imagemay include one or more of a processor, a stereo image sensor, and/orother components. In some implementations, the system may include one ormore motion and orientation sensors.

The stereo image sensor(s) may include a first image sensor, a secondimage sensor, and/or other components. The first image sensor may beconfigured to generate first visual output signals conveying visualinformation within a field of view of the first image sensor. The firstimage sensor may include one or more of a charge-coupled device sensor,an active pixel sensor, a complementary metal-oxide semiconductorsensor, an N-type metal-oxide-semiconductor sensor, and/or other imagesensors. The second image sensor may be configured to generate secondvisual output signals conveying visual information within a field ofview of the second image sensor. The second image sensor may include oneor more of a charge-coupled device sensor, an active pixel sensor, acomplementary metal-oxide semiconductor sensor, an N-typemetal-oxide-semiconductor sensor, and/or other image sensors.

The motion and orientation sensor(s) may be configured to generatemotion and orientation output signals conveying motion information andorientation information of the stereo image sensor. The motion andorientation sensor may include one or more of an accelerometer, a tiltsensor, an inclination sensor, an angular rate sensor, a gyroscope, aninertial measurement unit, a compass, a magnetometer, a pressure sensor,a barometer, a global positioning system device, a distance sensor,and/or other motion and orientation sensors.

The processor(s) may be configured by machine-readable instructions toexecute one or more computer program components. The computer programcomponents may include one or more of an imaging component, a depthimage component, a predicted motion component, a predicted imagingcomponent, a refine component, and/or other computer program components.

The imaging component may be configured to obtain one or more imagesand/or other information. The imaging component may be configured toobtain a first image, a second image, a third image, a fourth image,and/or other images. The first image may be determined based on thefirst visual output signals such that the first visual output signalsused to determine the first image are generated by the first imagesensor at a first time. The second image may be determined based on thesecond visual output signals such that the second visual output signalsused to determine the second image are generated by the second imagesensor at the first time.

The third image may be determined based on the first visual outputsignals such that the first visual output signals used to determine thefirst image are generated by the first image sensor at a second timethat is subsequent to the first time. The fourth image may be determinedbased on the second visual output signals such that the second visualoutput signals used to determine the second image are generated by thesecond image sensor at the second time.

The imaging component may be configured to undistort and rectify one ormore images. The imaging component may be configured to undistort andrectify the first image, the second image, and/or other images. Theimaging component may be configured to undistort and rectify the thirdimage, and/or other images. The imaging component may be configured toundistort and rectify the fourth image, and/or other images.

The depth image component may be configured to determine one or moredepth images based on one or more comparisons of images. The depth imagecomponent may be configured to determine a first depth image based on acomparison of the first image and the second image. The first depthimage may include pixel values characterizing distances between thestereo image sensor and objects within the first image and the secondimage. In some implementations, the depth image component may beconfigured to determine a second depth image based on a comparison ofthe third image and the fourth image. The second depth image may includepixel values characterizing distances between the stereo image sensorand objects within the third image and the fourth image.

The predicted motion component may be configured to obtain one or morepredicted motion of the stereo image sensor between different times. Thepredicted motion component may be configured to obtain a predictedmotion of the stereo image sensor between the first time and the secondtime. In some implementations, the predicted motion of the stereo imagesensor may be determined based on the motion and orientation outputsignals. The motion and orientation output signals may convey motioninformation and orientation information of the stereo image sensor. Insome implementations, the predicted motion may be determined based onvisual odometry.

The predicted imaging component may be configured to determine one ormore predicted images based on one or more predicted motion of thestereo image sensor, one or more depth images, and/or other information.The predicted imaging component may be configured to determine apredicted second image by adjusting the first image based on the firstdepth image, and/or other information. The predicted imaging componentmay be configured to determine a predicted third image by adjusting thefirst image based on the predicted motion, the first depth image, and/orother information. In some implementations, the predicted imagingcomponent may be configured to determine a predicted second depth imageby adjusting the first depth image based on the predicted motion and/orother information.

The refine component may be configured to refine one or more depthimages based on one or more comparisons of images with predicted images.The refine component may be configured to refine the first depth imagebased on a comparison of the second image with the predicted secondimage. In some implementations, refining the first depth image based onthe comparison of the second image with the predicted second image mayinclude determining a mismatch between an image patch in the secondimage with a corresponding image patch in the predicted second image.Responsive to the mismatch meeting and/or exceeding an error threshold,one or more pixel values of the first depth image corresponding to theimage patch may be rejected.

The refine component may be configured to refine the first depth imagebased on a comparison of the third image with the predicted third image.In some implementations, refining the first depth image based on thecomparison of the third image with the predicted third image may includedetermining a mismatch between an image patch in the third image with acorresponding image patch in the predicted third image. Responsive tothe mismatch meeting and/or exceeding an error threshold, one or morepixel values of the first depth image corresponding to the image patchmay be rejected.

In some implementations, the refine component may be configured torefine the second depth image based on a comparison of the second depthimage with the predicted second depth image. In some implementations,refining the second depth image based on the comparison of the seconddepth image with the predicted second depth image may includedetermining a mismatch between a pixel value in the second depth imagewith a corresponding pixel value in the predicted second depth image.Responsive to the mismatch meeting and/or exceeding an error threshold,a pixel value of the second depth image corresponding to the mismatchmay be rejected.

In some implementations, the error threshold may be based on inverse ofthe distances between the stereo image sensor and the objects. In someimplementations, the mismatch between the image patch in the secondimage with the corresponding image patch in the predicted second imagemay be determined based on an error distribution.

These and other objects, features, and characteristics of the systemand/or method disclosed herein, as well as the methods of operation andfunctions of the related elements of structure and the combination ofparts and economies of manufacture, will become more apparent uponconsideration of the following description and the appended claims withreference to the accompanying drawings, all of which form a part of thisspecification, wherein like reference numerals designate correspondingparts in the various figures. It is to be expressly understood, however,that the drawings are for the purpose of illustration and descriptiononly and are not intended as a definition of the limits of theinvention. As used in the specification and in the claims, the singularform of “a”, “an”, and “the” include plural referents unless the contextclearly dictates otherwise.

BRIEF DESCRIPTION OF THE DRAWINGS

FIG. 1 illustrates a vehicle configured to detect a moving object usingadjusted optical flow.

FIG. 2A illustrates a top-down view of an arrangement of an imagesensor, a sphere, and a block at time t.

FIG. 2B illustrates an exemplary image at time t.

FIG. 3A illustrates a top-down view of an arrangement of an imagesensor, a sphere, and a block at time t+1.

FIG. 3B illustrates an exemplary image at time t+1.

FIG. 4 illustrates an exemplary predicted image at time t.

FIG. 5 illustrates an exemplary determined optical flow.

FIG. 6 illustrates an exemplary adjusted optical flow.

FIG. 7 illustrates a method to detect a moving object using adjustedoptical flow.

FIG. 8 illustrates a system for collision avoidance of an unmannedaerial vehicle using a spherical depth map.

FIGS. 9A-9D illustrate exemplary environments around an unmanned aerialvehicle.

FIG. 10 illustrates an exemplary spherical depth map.

FIG. 11 illustrates a method for collision avoidance of an unmannedaerial vehicle using a spherical depth map.

FIG. 12 illustrates a system for collision avoidance of an unmannedaerial vehicle using a predicted vehicle path.

FIGS. 13A-13E illustrate exemplary environments around an unmannedaerial vehicle.

FIG. 14 illustrates a method for collision avoidance of an unmannedaerial vehicle using a predicted vehicle path.

FIG. 15 illustrates a system for estimating an ego-motion.

FIG. 16A illustrates a top-down view of an arrangement of a stereo imagesensor, a sphere, and a block at time t.

FIG. 16B illustrates a top-down view of an arrangement of a stereo imagesensor, a sphere, and a block at time t+1.

FIG. 16C illustrates a predicted pose of a stereo image sensor.

FIG. 17A-17B illustrate exemplary images at time t.

FIGS. 17C-17D illustrate exemplary images at time t+1.

FIG. 18 illustrates an exemplary depth image at time t.

FIGS. 19A-19B illustrate exemplary images and predicted images at timet+1.

FIG. 20 illustrates a method for estimating an ego-motion.

FIG. 21 illustrates a system for refining a depth image.

FIG. 22A illustrates an exemplary depth image at time t.

FIG. 22B illustrates an exemplary depth image at time t+1.

FIGS. 23A-23B illustrate exemplary images at time t and t+1, andpredicted images at time t and t+1.

FIG. 24 illustrate exemplary depth image and predicted depth image.

FIG. 25 illustrates a method for refining a depth image.

DETAILED DESCRIPTION

FIG. 1 illustrates vehicle 10 configured to detect a moving object usingadjusted optical flow. Vehicle 10 may include one or more ofhardware-implemented processor 11, image sensor 12, motion andorientation sensor 13, electronic storage 14, motor 15, locomotionmechanism 16, interface 17 (e.g., bus, wireless interface, etc.), and/orother components. To detect a moving object, two images taken atdifferent times using image sensor 12 may be compared with each other todetermine optical flow between the two images. The determined opticalflow may be adjusted to account for any change in the field of view ofimage sensor 12 between the two images. By way of non-limiting example,the field of view of image sensor 12 when the first image was taken maybe different from the field of view of the image sensor 12 the secondimage was taken due to movement of vehicle 10. The presence of themoving object may be detected based on the adjusted optical flow. Thedetection of the moving object may allow vehicle 10 to avoid or followthe moving object. The object type of the moving object may beidentified, and one or more operating behaviors of vehicle 10 may beeffectuated based on the identified object type.

Image sensor 12 may be configured to generate visual output signalsconveying visual information within the field of view of image sensor12. Visual information may include one or more of an image, a video,and/or other visual information. Image sensor 12 may include one or moreof a charge-coupled device sensor, an active pixel sensor, acomplementary metal-oxide semiconductor sensor, an N-typemetal-oxide-semiconductor sensor, and/or other image sensors. In someimplementations, image sensor 12 may be located remotely from vehicle10. In some implementations, image sensor 12 may be located on or invehicle 10. In some implementations, hardware-implemented processor 11,image sensor 12, and motion and orientation sensor 13 may be carried(e.g., attached to, supported, held, disposed on, and/or otherwisecarried) by vehicle 10 and the field of view of image sensor 12 may be afunction of the position and orientation of vehicle 10.

Motion and orientation sensor 13 may be configured to generate outputsignals conveying motion information and orientation information ofimage sensor 12. Motion information may include one or more motioninformation regarding speed of image sensor 12, distance traveled byimage sensor 12, and/or movement of image sensor 12, including one ormore of moving forward, moving backwards, moving right, moving left,moving up, moving down, other movements, and/or other motioninformation. Orientation information may include one or more orientationinformation regarding orientation of image sensor 12, including one ormore of turning right, turning left, rolling right, rolling left,pitching up, pitching down, and/or other orientation information. Motioninformation and/or orientation information may be processed to obtainmotions and/or orientations of image sensor 12 at particular timesand/or locations. Motion and orientation sensor 13 may include one ormore of an accelerometer, a tilt sensor, an inclination sensor, anangular rate sensor, a gyroscope, an inertial measurement unit, acompass, a magnetometer, a pressure sensor, a barometer, a globalpositioning system device, a distance sensor, and/or other motion andorientation sensors.

Electronic storage 14 may include electronic storage media thatelectronically stores information. Electronic storage 14 may storesoftware algorithms, information determined by hardware-implementedprocessor 11, information received remotely, and/or other informationthat enables vehicle 10 to function properly. For example, electronicstorage 14 may store visual information (as discussed elsewhere herein),and/or other information.

Motor 15 may be configured to drive locomotion mechanism 16 toeffectuate movement of vehicle 10 along any direction. Locomotionmechanism 16 may include one or more component to effectuate movement ofvehicle 10. By way of non-limiting example, locomotion mechanism 16 mayinclude one or more of a wheel, a rotor, a leg, a continuous track,and/or other locomotion mechanisms.

Hardware-implemented processor 11 may be configured to provideinformation processing capabilities in vehicle 10. Hardware-implementedprocessor 11 may include one or more computing/processing devices withone or more algorithms/logics implemented in hardware. Algorithms/logicsmay be implemented in hardware via machine-readable instructions (e.g.,a hardware description language file, a netlist, etc.). As anon-limiting example, a hardware-implemented processor(s) may includeone or more of a field-programmable gate array, an application-specificintegrated circuit, and/or other hardware-implemented processors. Insome implementations, hardware-implemented processor 11 may include aplurality of processing units. In some implementations,hardware-implemented processor 11 may be coupled with one or more ofRAM, ROM, input/output ports, and/or other peripherals.

Hardware-implemented processor 11 may be configured by machine-readableinstructions to execute one or more function components. The functioncomponents may include one or more of imaging component 20, predictedchange component 21, predicted imaging component 22, actual changecomponent 23, optical flow component 24, optical flow adjustmentcomponent 25, detection component 26, behavior component 27, and/orother function components.

Imaging component 20 may be configured obtain images. The images may bedetermined based on visual output signals conveying visual information.The images may be determined by one or more of imaging component 20,image sensor 12, a computing/processing device coupled to image sensor12, and/or other components. Visual output signals conveying visualinformation may be generated by image sensor 12. Imaging component 20may be configured to obtain an image such that the visual output signalsused to determine the image are generated by image sensor 12 at time t.Imaging component 20 may be configured to obtain another image such thatthe visual output signals used to determine the other image aregenerated by image sensor 12 at time t+1, which is subsequent to time t.Imaging component 20 may rectify and undistort the images.

For example, image sensor 12 may include one or more charge-coupleddevice sensors. Image sensor 12 may generate visual output signalsconveying visual information within the field of view of image sensor 12at time t by sequentially measuring the charges of individual rows ofpixels (e.g., capacitors) of image sensor 12, where the charges arestored in image sensor 12 at time t. For example, image sensor 12 mayinclude n rows of pixels. Visual output signals conveying visualinformation may be generated by measuring charges of the first row ofpixels, then measuring charges of the second row of pixels, andcontinuing on until charges of all rows of pixels are measured, wherecharges are stored in the pixels of image sensor 12 at time t. Othertypes of image sensors and other methods of determining images arecontemplated.

The field of view of image sensor 12 may change between time t and timet+1. For example, movement of vehicle 10 may change the field of view ofimage sensor 12.

FIGS. 2-6 illustrates a non-limiting example of a system and a methoddescribed herein to detect a moving object using adjusted optical flow.FIG. 2A illustrates a top-down view at time t of image sensor 12, andsphere 40 and block 50, which are both within the field of view of imagesensor 12. Straight-ahead line 30 indicates the middle of the field ofview of image sensor 12. Image component 20 may determine image at t 60,illustrated in FIG. 2B, based on visual output signals conveying visualinformation, which may be generated by image sensor 12 at time t.

FIG. 3A illustrates a top-town view at time t+1 of image sensor 12,sphere 40, and block 50. Compared to time t, image sensor 12 and block50 have both moved to the right by distance d while sphere 40 hasremained stationary. Image component 20 may determine image at t+1 61,illustrated in FIG. 3B, based on visual output signals conveying visualinformation, which may be generated by image sensor 12 at time t+1. Inimage at t+1 61, block 50 appears in the same location relative to block50 in image at t 60, while sphere 40 appears to the left by distance drelative to sphere 40 in image at t 60.

Predicted change component 21 may be configured to obtain predictedchanges in the field of view of image sensor 12 between time t and timet+1. The predicted changes in the field of view of image sensor 12between time t and time t+1 may be determined based on motion andorientation output signals conveying motion information and orientationinformation of image sensor 12. The predicted changes in the field ofview of image sensor 12 may be determined by a companion processor,and/or other components. A companion processor may refer to acomputing/processing device that may provide information processingcapabilities in conjunction with hardware-implemented processor 11. Acompanion processor may comprise one or more of a digital processor, ananalog processor, a digital circuit designed to process information, acentral processing unit, a graphics processing unit, a microcontroller,an analog circuit designed to process information, a state machine,and/or other mechanisms for electronically processing information. Acompanion processor may be carried by vehicle 10 or located remotelyfrom vehicle 10. For example, a component processor may be locatedremotely from vehicle 10 and may communicate with vehicle10/hardware-implemented processor 11 via a wireless communication link.

Motion and orientation output signals conveying motion information andorientation information may be generated by motion and orientationsensor 13. Motion information of image sensor 12 may include one or moremotion information regarding speed of image sensor 12, distance traveledby image sensor 12, and/or movement of image sensor 12, including one ormore of moving forward, moving backwards, moving right, moving left,moving up, moving down, other movements, and/or other motioninformation. Orientation information of image sensor 12 may include oneor more orientation information regarding orientation of image sensor12, including one or more of turning right, turning left, rolling right,rolling left, pitching up, pitching down, and/or other orientationinformation.

For example, a companion processor may process motion information andorientation information of image sensor 12 shown in FIGS. 2A and 3A andpredict that the field of view of image sensor 12 has moved to the rightby distance d between time t and time t+1. The companion processor maycommunicate the predicted changes in the field of view of image sensor12 (e.g., change in pose (position and orientation) of image sensor 12)to hardware-implemented processor 11.

Predicted imaging component 22 may be configured to determine apredicted image at time t. The predicted image at time t may include aprediction of how objects within the field of view of image sensor 12 attime t+1 may have looked if image sensor 12 had not moved between time tand time t+1. The predicated image at time t may be determined byadjusting an image determined at time t+1 based on the predicted changein the field of view of image sensor 12 between time t and time t+1.Predicted imaging component 22 may determine a predicted image viainverse composition warping and/or other operations.

For example, predicted imaging component 22 may be configured todetermine predicted image at t 62, illustrated in FIG. 4, by adjustingimage at t+1 61, illustrated in FIG. 3B. To account for the predictedmovement to the right by distance d of the field of view of image sensor12, predicted imaging component 22 may shift both sphere 40 and block 50in image at t+1 61 by distance d to the right. Predicted image at t 62may include a range of predictions regarding how sphere 40 and block 50may have looked to image sensor 12 at time t. For example, ranges ofpredictions for sphere 40 and block 50 at time t are shown in FIG. 4 aspredicted sphere 41 and predicted block 51.

Actual change component 23 may be configured to obtain actual changes inthe field of view of image sensor 12 between time t and time t+1. Theactual change in the field of view of image sensor 12 between time t andtime t+1 may be determined based on a comparison of an image determinedat time t and a predicted image at time t. The actual changes in thefield of view of image sensor 12 may be determined by one or more ofhardware-implemented processor 11, a companion processor, and/or othercomponents. By way of non-limiting example, comparison of two images maybe made by comparing intensities of pixels of the image determined attime t and intensities of pixels of the predicated image at time t.Actual changes in the field of view of image sensor 12 may be determinedby determining the positions and orientations of image sensor 12 thatminimizes the sum of squared differences of the pixel intensities.

Actual change component 23 may perform the comparison of pixels toevaluate the differences between intensities of pixels between theimages. The differences between the intensities of pixels between theimages may be used to adjust the predicted changes in the field of viewof image sensor 12 to reduce the differences between the intensities ofpixels between the images. The adjustments to the predicted changes inthe field of view of image sensor 12 may be determined throughprocessing and communication between hardware-implemented processor 11and a companion processor.

For example, actual change component 23 may provide informationregarding the differences between the intensities of pixels of theimages (e.g., provided in a set of 6×6 linear equations) to thecompanion processor. The companion processor may use the information(e.g., invert the set of linear equations) to determine changes to thepredicted change in the field of view of image sensor 12 (e.g., newdelta added to 6 degrees of freedom pose estimate). The companionprocessor may provide the resulting predicted change in the field ofview of image sensor 12 (e.g., new pose estimate) tohardware-implemented processor 11. Predicted imaging component 22 maydetermine a new predicted image based on the new predicted change in thefield of view of image sensor 12 and the comparison of the images may berepeated to refine pose (position and orientation) estimates. The actualchanges in the field of view of image sensor 12 may be obtained when thedifferences between intensities of pixels between the images is below athreshold. In some implementations, the companion processor may processthe actual changes in the field of view of image sensor 12 with anExtended Kalman filter to obtain position, orientation, velocity, and/orother movement information of image sensor 12.

In some implementations, comparison of the images may be accomplishedthrough an iterative process in which image resolutions may be changed.For example, the comparison may begin with an initial comparison thatuses down-sampled versions of the images. By way of non-limitingexample, the images may be down-sampled by a factor of eight. Afterinitial positions and orientations of image sensor 12 are determinedfrom initial comparison of images that have been down-sampled by afactor of eight, the images may be compared again using images that havebeen down-sampled by a factor of four, and the process may continueuntil acceptable positions and orientations of image sensor 12 aredetermined or comparison of up-sampled images does not provide furtherinformation.

In some implementations, determination of the actual change in the fieldof view may include, when comparing an image determined at time t and apredicted image at time t, disregarding portions of the image determinedat time t and the predicted image at time t. Portions to be disregardedmay be determined by using the laplacian of gaussian convolution. By wayof non-limiting example, portions of the images may be disregarded basedon an error in disparity of the movement of image sensor 12, or movementof objects within the field of view of image sensor 12. For example,portions of image at t 60 corresponding to block 50 and predicted imageat t 62 corresponding to predicted block 51 may be disregarded becausecomparison of the images indicate that block 50 has moved between time tand time t+1, and the corresponding locations do not assist in thedetermining actual changes in the field of view of image sensor 12.

Optical flow component 24 may be configured to obtain optical flowbetween an image determined at time t and an image determined at timet+1. For example, FIG. 5 illustrates optical flow 70 between image at t60 and image at t+1 61. Optical flow 70 may show an apparent motion of aspherical object to the left. The optical flow between the two imagesmay be determined by one or more of hardware-implemented processor 11, acompanion processor, and/or other components.

Optical flow may be determined using a variety of methods, including,but not limited to one or more of phase correlation, block-basedmethods, differential methods (such as Lucas-Kanade method, Horn-Schunckmethod, Buxton-Buxton method, Black-Jepson method, and/or generalvariational methods), discrete optimization methods, and/or othermethods. In some implementations, determining optical flow between animage determined at time t and an image determined at time t+1 mayinclude determining individual optical flow vectors between the images.The optical flow vectors may be determined for an array of location,pixel by pixel, or based on a group of pixels.

Optical flow adjustment component 25 may be configured obtain adjustedoptical flow. The adjusted optical flow may be determined based on theactual change in the field of view. The adjusted optical flow may bedetermined by one or more of hardware-implemented processor 11, acompanion processor, and/or other components. The adjusted optical flowmay be determined by adjusting the optical flow to account for actualchanges in the field of view of image sensor 12.

In some implementations, adjusting optical flow to account for actualchanges in the field of view may include adjusting individual opticalflow vectors to offset the actual change in the field of view. Forexample, optical flow vectors determined between two images may beoffset to account for actual changes in the field of view. Suchadjustment may reduce optical flow arising from changes in the field ofview of image sensor 12. For example, optical flow adjustment component25 (and/or a companion processor) may adjust optical flow 70, asillustrated in FIG. 5, to account for actual changes in the field ofview of image sensor 12 between time t and time t+1. As illustrated inFIG. 6, optical flow adjustment component 25 (and/or a companionprocessor) may remove optical flow showing an apparent motion of aspherical object to the left as being caused by the field of view ofimage sensor 12 moving to the right by distance d between time t andtime t+1. Optical flow adjustment component 25 (and/or a companionprocessor) may add optical flow showing a motion of a square object tothe right between time t and time t+1.

Detection component 26 may be configured to obtain the presence of amoving object. The presence of the moving object may be determined byone or more of hardware-implemented processor 11, a companion processor,and/or other components. The presence of the moving object may bedetected based on adjusted optical flow. For example, detectioncomponent 26 (and/or a companion processor) may detect the presence of amoving object based on one or more qualities of adjusted optical flow,including one or more of magnitude, direction, and/or other qualities ofadjusted optical flow. By way of non-limiting example, detectioncomponent 26 (and/or a companion processor) may segment differentportions of adjusted optical flow into sections corresponding todifferent objects based on discontinuities in magnitude, direction,and/or other qualities of adjusted optical flow. For example, detectioncomponent 26 (and/or a companion processor) may segment a square sectionof adjusted optical flow 71 based on the magnitude of adjusted opticalflow and/or direction of adjusted optical flow to the right.

Detection component 26 (and/or a companion processor) may set a minimumthreshold value for magnitude, direction, and/or other qualities ofadjusted optical flow that must exceeded to be considered a potentialmoving object. Such use of a minimum threshold value may reduce falsedetection of a moving object based on noise and/or error in adjustedoptical flow.

In some implementations, detection component 26 (and/or a companionprocessor) may be configured to identify clusters of adjusted opticalflow vectors. Detection component 26 (and/or a companion processor) mayidentify clusters of adjusted optical flow vectors based on same/similarmagnitude, same/similar direction, and/or other same/similar qualitiesof adjusted optical flow vectors. For example, detection component 26(and/or a companion processor) may identify a cluster of adjustedoptical flow if the direction of optical flow within the cluster doesnot deviate from a direction by certain degrees. As another example,detection component 26 (and/or a companion processor) may identify acluster of adjusted optical flow if the magnitude of optical flow withinthe cluster is within a certain magnitude range. As another example,detection component 26 (and/or a companion processor) may identify acluster of adjusted optical flow if the direction of optical flow withinthe cluster does not deviate from a direction by certain degrees and themagnitude of optical flow within the cluster is within a certainmagnitude range.

In some implementations, detection component 26 (and/or a companionprocessor) may detect the presence of a moving object throughidentification of a cluster of adjusted optical flow vectors. Forexample, detection component 26 (and/or a companion processor) mayidentify multiple clusters of adjusted optical flow vectors and maydetect a moving object for one or more clusters of adjusted optical flowvectors. As another example, detection component 26 (and/or a companionprocessor) may identify a single object based on multiple clusters ofadjusted optical flow vectors.

In some implementation, detection component 26 (and/or a companionprocessor) may be configured to identify an object type of the movingobject based on the adjusted optical flow. An object type may refer toone or more categories of objects that have one or more commoncharacteristics. A common characteristic may refer to a permanent ortemporary feature of an object relating to the shape, motion, behaviorand/or other features of the object. Common characteristics may pertainto an entire object, or one or more portions of the object. Non-limitingexamples of object types include a person, one or more parts of a person(e.g., arm, hand, head, body, leg, feet, etc.), an animal, one or moreparts of an animal, a vehicle, one or more parts of a vehicle (e.g.,wheel, door, engine, trunk, window, wing, propeller, rotor, etc.), andother object types.

Detection component 26 (and/or a companion processor) may identify anobject type by matching the adjusted optical flow to an object type. Forexample, detection component 27 (and/or a companion processor) mayidentify an object type by matching the adjusted optical flow to one ormore of a shape, a motion, a behavior and/or other features of theobject type. Object types may be programmed into detection component 26(and/or a companion processor), updated by detection component 26(and/or a companion processor), obtained by detection component 26(and/or a companion processor) from electronic storage 14, obtained bydetection component 26 (and/or a companion processor) from remotelocation, and/or obtained by detection component 26 (and/or a companionprocessor) in other ways.

In some implementations, identification of an object type may beaccomplished through an iterative process. For example, adjusted opticalflow may be determined for a moving object at different times, and theobject type of the moving object may be refined or changed based onsubsequently determined adjusted optical flow. By way of non-limitingexample, detection component 26 (and/or a companion processor) mayinitially identify the object type of a moving object as a vehicle basedon initial optical flow, and subsequently refine the object type as amotorcycle based on subsequently determined adjusted optical flow.

In some implementations, behavior component 27 may be configured toeffectuate one or more operating behaviors of a vehicle based on anidentified object type. Operating behavior may refer to one or moremotions and/or operations of a vehicle and/or one or more components ofa vehicle. Motion of a vehicle and/or one or more components of avehicle may refer to motion of the vehicle/component(s) at a time,motion of the vehicle/component(s) over a period of time, motion of thevehicle/component(s) at a location, and/or motion of thevehicle/component(s) over a distance. Operation of a vehicle and/or oneor more components of a vehicle may refer to operation of thevehicle/component(s) at a time, operation of the vehicle/component(s)over a period of time, operation of the vehicle/component(s) at alocation, and/or operation of the vehicle/component(s) over a distance.Operating behavior may be programmed into behavior component 27, updatedby behavior component 27, obtained by behavior component 27 fromelectronic storage 14, obtained by behavior component 27 from remotelocation, and/or obtained by behavior component 27 in other ways.

In some implementations, hardware-implemented processor 11 and imagesensor 12 may be carried on or in vehicle 10, and hardware-implementedprocessor 11 may execute behavior component 27 to effectuate one or moreoperating behaviors of vehicle 10. By way of non-limiting example,vehicle 10 may include an unmanned aerial vehicle, hardware-implementedprocessor 11 may include one or more processing units of the unmannedaerial vehicle, and image sensor 12 may include one or more imagesensors of the unmanned aerial vehicle. One or more processing units ofthe unmanned aerial vehicle may execute behavior component 27 toeffectuate one or more operating behaviors of the unmanned aerialvehicle.

In some implementations, hardware-implemented processor 11 and imagesensor 12 may be remotely located from vehicle 10, andhardware-implemented processor 11 remotely located from vehicle 10 mayexecute behavior component 27 to effectuate one or more operatingbehaviors of vehicle 10. By way of non-limiting example, vehicle 10 mayinclude an unmanned aerial vehicle, hardware-implemented processor 11may include one or more processing units coupled to a moving camera,such as a security camera, a moving vehicle, or another unmanned aerialvehicle, and image sensor 12 may be one or more image sensors of themoving camera. One or more components of the moving camera maywirelessly communicate with one or more components of the unmannedaerial vehicle. One or more processing units of the moving camera mayexecute behavior component 27 to effectuate one or more operatingbehaviors of the unmanned aerial vehicle. For example, an unmannedaerial vehicle may detect a moving person within the field of view ofits image sensor(s), and the unmanned aerial vehicle may effectuate oneor more operating behaviors of another unmanned aerial vehicle withinthe vicinity of the moving person.

In some implementations, hardware-implemented processor 11 may becarried on or in vehicle 10 and image sensor 12 may be remotely locatedfrom vehicle 10. By way of non-limiting example, vehicle 10 may includean unmanned aerial vehicle, and hardware-implemented processor 11 mayinclude one or more processing units of the unmanned aerial vehicle.Image sensor 12 may include one or more image sensors of a movingcamera, such as a security camera, a moving vehicle, or another unmannedaerial vehicle. One or more components of the moving camera maywirelessly communicate with one or more components of the unmannedaerial vehicle. One or more processing units of the unmanned aerialvehicle may execute behavior component 27 to effectuate one or moreoperating behaviors of the unmanned aerial vehicle. For example, asecurity camera may detect a moving person within the field of view ofits images sensor(s), and wirelessly communicate with an unmanned aerialvehicle within the vicinity of the moving person. One or more processingunits of the unmanned aerial vehicle may effectuate one or moreoperating behaviors of the unmanned aerial vehicle based on the securitycamera's detection of the moving person.

In some implementations, one or more operating behaviors of a vehiclemay include the vehicle maintaining a minimum distance between themoving object and/or a maximum distance from the moving object. Aminimum distance may refer to one or both of horizontal distance and/orvertical distance that the vehicle must keep between the vehicle and themoving object. For example, vehicle 10 may be configured to avoid amoving object by maintaining a certain horizontal distance and/or acertain vertical distance from the moving object. A maximum distance mayrefer to one or both of horizontal distance and/or vertical distancefrom the moving object within which vehicle 10 must stay. For example,vehicle 10 may be configured to follow a moving object by staying withina certain horizontal distance and/or a certain vertical distance aroundthe moving object. A vehicle may be configured to maintain a setdistance from the moving object by setting the same distance for theminimum distance and the maximum distance.

In some implementations, one or more operating behaviors of a vehiclemay include the vehicle maintaining a minimum speed and/or a maximumspeed. A minimum speed may refer to one or both of linear speed and/orangular speed that the vehicle should not drop below. A maximum speedmay refer to one or both of linear speed and/or angular speed that thevehicle should not exceed. A vehicle may be configured to maintain a setspeed in the vicinity of the moving object by setting the same speed forthe minimum speed and the maximum speed.

In some implementations, one or more operating behaviors of a vehiclemay include the vehicle maintaining the moving object within the fieldof view of the image sensor. Such operation may include maintainingand/or changing one or more of the vehicle position, height, speed,direction, path, and/or other operation of the vehicle. For example,vehicle 10 may detect a moving person and may operate to keep the personwithin the field of view of its image sensor 12 by maintaining and/orchanging one or more of its position, height, speed, direction, path,and/or other operations.

In some implementations, one or more operating behaviors of a vehiclemay include the vehicle facilitating wireless communication of thevisual information to a remote computing device. For example, vehicle 10may detect a moving person and wirelessly transmit output signalsconveying visual information within the field of view of its imagesensor 12. The visual information may include one or more of an image, avideo, and/or other visual information regarding the moving person.

Referring to FIG. 8, spherical depth map system 800 for collisionavoidance of an unmanned aerial vehicle may include one or more ofprocessor 811, electronic storage 814, interface 817, and/or othercomponents. In some implementations, spherical depth map system 800 mayinclude one or more of stereo image sensor 812. Depth information for anenvironment around the unmanned aerial vehicle may be obtained. Thedepth information may characterize distances between the unmanned aerialvehicle and the environment around the unmanned aerial vehicle. Aspherical depth map may be generated from the depth information. Acenter of the spherical depth map may coincide with a location of theunmanned aerial vehicle. The spherical depth map may represent distancesto closest surfaces of the environment around the unmanned aerialvehicle as a function of longitude angles and latitude angles. Maneuvercontrols for the unmanned aerial vehicle may be provided based on thespherical depth map.

Stereo image sensor 812 may be carried by the unmanned aerial vehicle.Stereo image sensor 812 may include first image sensor 831, second imagesensor 832, and/or other components. First image sensor 831 may beconfigured to generate first visual output signals. The first visualoutput signals may convey visual information within a field of view offirst image sensor 831. Visual information may include one or more of animage, a video, and/or other visual information. First image sensor 831may include one or more of a charge-coupled device sensor, an activepixel sensor, a complementary metal-oxide semiconductor sensor, anN-type metal-oxide-semiconductor sensor, and/or other image sensors.Second image sensor 832 may be configured to generate second visualoutput signals. The second visual output signals may convey visualinformation within a field of view of second image sensor 832. Secondimage sensor 832 may include one or more of a charge-coupled devicesensor, an active pixel sensor, a complementary metal-oxidesemiconductor sensor, an N-type metal-oxide-semiconductor sensor, and/orother image sensors.

Electronic storage 814 may include electronic storage media thatelectronically stores information. Electronic storage 814 may storesoftware algorithms, information determined by processor 811,information received remotely, and/or other information that enablesspherical depth map system 800 to function properly. For example,electronic storage 814 may store information relating to spherical depthmap (as discussed elsewhere herein), and/or other information.

Processor 811 may be configured to provide information processingcapabilities in spherical depth map system 800. As such, processor 811may comprise one or more of a digital processor, an analog processor, adigital circuit designed to process information, a central processingunit, a graphics processing unit, a microcontroller, an analog circuitdesigned to process information, a state machine, and/or othermechanisms for electronically processing information. By way ofnon-limiting example, a microcontroller may include one or more of 8051,PIC, AVR, ARM microcontroller, and/or other microcontrollers. In someimplementations, processor 811 may include a plurality of processingunits. In some implementations, processor 811 may be coupled with one ormore of RAM, ROM, input/output ports, and/or other peripherals.

Processor 811 may be configured by machine-readable instructions toexecute one or more computer program components. The computer programcomponents may include one or more of depth information component 820,spherical depth map component 821, maneuver controls component 822,and/or other computer program components. In some implementations,processor 811 may include one or more of a hardware-implementedprocessor, a software-implemented processor, and/or other processors. Ahardware-implemented processor may include one or morecomputing/processing devices with one or more algorithms/logicsimplemented in hardware to perform one or more functions. Asoftware-implemented processor may include one or morecomputing/processing devices with one or more algorithms/logicsimplemented in software to perform one or more functions.

In some implementations, one or more hardware-implemented processors maybe located remotely from one or more software-implemented processors. Insome implementations, one or more hardware-implemented processors andone or more software-implemented processors may be arranged and used asdescribed in U.S. Provisional Patent Application No. 62/203,754,entitled “SYSTEM AND METHOD FOR OBSTACLE DETECTION FOR MOBILE ROBOTS,”filed on Aug. 11, 2015, the foregoing being incorporated herein byreference in its entirety.

For example, a hardware-implemented processor may be located within anunmanned aerial vehicle and a software-implemented processor (e.g., acompanion processor, etc.) may be located remotely from the unmannedaerial vehicle. The hardware-implemented processor may communicate withthe software-implemented processor via a wireless communication link(e.g., Gigabit Ethernet link, etc.). The hardware-implemented processorand the software-implemented processor may perform one or more functionsof the computer program components and/or other functions. For example,the hardware-implemented processor may perform streaming based tasks inreal-time and show a light-weight polar-coordinate map (e.g., sphericaldepth map) to allow the software-implemented processor to fully processthe data of stereo image sensor 812 and perform obstacle detection inreal-time.

Depth information component 820 may be configured to obtain depthinformation for an environment around the unmanned aerial vehicle and/orother information. Depth information component 820 may obtain depthinformation from one or more sensors carried by the unmanned aerialvehicle, one or more sensors located remotely from the unmanned aerialvehicle, and/or from other locations. For example, depth informationcomponent 820 may obtain depth information from a distance sensorcarried by the unmanned aerial vehicle, a distance sensor locatedremotely from the unmanned aerial vehicle, and/or from an electronicstorage containing depth information for the environment around theunmanned aerial vehicle. The depth information may characterize one ormore distances between the unmanned aerial vehicle and the environmentaround the unmanned aerial vehicle.

FIGS. 9A-9B illustrate non-limiting examples of environment around UAV900 at one point in time. FIG. 9A provides a perspective view and FIG.9B provides a side view of the environment around UAV 900. As shown inFIGS. 9A-9B, block arch 910 and cylinder 920 may be located in front ofUAV 900. UAV 900 may be level with bottom of block arch 910 and cylinder920. Depth information component 820 may obtain depth information forenvironment around UAV 900, including block arch 910 and cylinder 920.Depth information component 820 may obtain depth information atdifferent longitude and latitude angles around UAV 900. For example,depth information component may obtain depth information for surface A912 of block arch 910 and for surface B 922 of cylinder 920, whichcorrespond to a longitude angle of zero degrees (directly in front ofUAV 900) and a latitude angle of a degrees.

The depth information may include one or more of distance information,disparity information, and/or other depth information. For example,depth information component 820 may obtain one or more of distanceinformation, disparity information, and/or other depth information forsurface A 912 and surface B 912. One or more of distance information,disparity information, and/or other depth information may be obtainedfrom and/or through use of distance sensors. Distance sensors mayinclude sensors that provide depth information for objects around thedistance sensors. As non-limiting examples, distance sensors may includeone or more of image sensors, infrared distance sensors, laserrangefinders, Lidar, ultrasonic distance sensors, range cameras, and/orother distance sensors. For example, depth information component 820 maybe configured to compare the visual information from one image sensorwith the visual information from another image sensor (e.g., imagesensors of a stereo image sensor, etc.) to determine the depthinformation and/or other information. The depth information may beundistorted and rectified based on a relative position and orientationof the two image sensors.

Spherical depth map component 821 may be configured to generate aspherical depth map from the depth information and/or other information.FIG. 10 illustrates an exemplary spherical depth map 1000. The center ofspherical depth map 1000 may coincide with a location of the unmannedaerial vehicle (e.g., UAV 900, etc.). The spherical depth map mayrepresent distances to closest surfaces of the environment around theunmanned aerial vehicle as a function of longitude angles and latitudeangles (e.g., around UAV 900, etc.). The spherical depth map may capturethe local geometry of the unmanned aerial vehicle's environmentefficiently.

In some implementations, the spherical depth may be divided into mapcells corresponding to longitude angles and latitude angles. Forexample, the surface of spherical depth map 1000 may be discretized intoN rows, which correspond to θ∈[0, 2π] and M columns for Φ∈[Φ′, π−Φ′].

For example, spherical depth map 1000 may be divided into map cells byhorizontal lines and vertical lines. Differing map resolution may beused, such as 200×100, 200×500, 1000×500. Other map resolutions arecontemplated. Low resolution of the spherical depth map may allow forfast processing (e.g., inserting, updating, retrieval, filtering, etc.)of the depth information from the spherical depth map.

The division of the spherical depth map into map cells may allow depthinformation for a spherical environment around an unmanned aerialvehicle to be mapped to a fixed sized grid of discrete bearings. The mapcells may include particular map cells corresponding to particularbearing angles (particular longitude angles and particular latitudeangles). Individual map cells may include depth information fordifferent bearing angles. For example, spherical depth map 1000 mayinclude map cell A 1010, map cell B 1020, and/or other map cells. Mapcell A 1010 and map cell B 1020 may correspond to different bearingangles. Map cell A 1010 may correspond to a bearing angle with alatitude angle of zero degrees (level with the unmanned aerial vehicle)and map cell B 1020 may correspond to a bearing angle with a latitudeangle near the pole. Map cell A 1010 may be larger in size than map cellB 1020.

In some implementations, generating the spherical depth map may includereducing a three-dimensional map of a surrounding into the sphericaldepth map. Reducing a three-dimensional map of a surrounding into thespherical depth map may reduce the overall map data and allow for fasterdata insertion and/or retrieval. For example, to insert disparity datainto a spherical depth map, the disparity data may be projected into 3Dpoints. The projected disparity data may be transformed from the cameraframe into the map frame. The disparity data in the map frame may beprojected into a spherical coordinate system. In some implementations,disparity data may be inserted into a spherical depth map as describedin U.S. Provisional Patent Application No. 62/203,754, entitled “SYSTEMAND METHOD FOR OBSTACLE DETECTION FOR MOBILE ROBOTS,” filed on Aug. 11,2015, incorporated supra.

In some implementations, generating the spherical depth map may includedetermining distance values for the map cells. The distance values maybe determined based on the distances to the closest surfaces of theenvironment around the unmanned aerial vehicle and/or other information.A distance value for a particular map cell may be determined based on adistance to a closest surface of the environment around the unmannedaerial vehicle at a particular longitude angle and a particular latitudeangle and/or other information. For example, as shown in FIGS. 9A-9B, abearing from UAV 900 at a longitude angle of zero degrees (directly infront of UAV 900) and a latitude angle of a degrees may include twosurfaces: surface A 912 and surface B 912. The distance value for thisparticular bearing angle may be determined based on the distance betweenUAV 900 and surface A 912.

In some the determination of distance values may disregard distances tosurfaces of the environment around the unmanned aerial vehicle atindividual longitude angles and individual latitude angles that aregreater than the distances to the closest surfaces of the environmentaround the unmanned aerial vehicle at the individual longitude anglesand the individual latitude angles. The distance values may correspondonly to the closest surfaces of the environment around the unmannedaerial vehicle at individual longitude angles and individual latitudeangles. For example, in FIGS. 9A-9B, the determination of the distancevalue for the bearing having a longitude angle of zero degrees (directlyin front of UAV 900) and a latitude angle of a degrees may disregarddistance to surface B 922. Such a determination of distance values mayallow for tracking of surfaces immediately surrounding an unmannedaerial vehicle without having to keep track of the entirethree-dimensional space.

In some implementations, the spherical depth map may include one or moredead zones in one or more polar regions. The distance values may not bedetermined for the map cells in the dead zone(s). The density of mapcells may become large near the polar regions of the spherical depthmap. The density of map cells near/in the polar regions may be greaterthan needed for vehicle guidance. Not processing depth information forthese regions may reduce memory/processing requirements. The amount ofpolar regions to remove from processing may be determined based on thesize of the unmanned aerial vehicle. Smaller sizes of the unmannedaerial vehicle may correspond to smaller sizes of the removed polarregions. The polar regions may be defined by angle (V.

In some implementations, spherical depth map component 821 may beconfigured to, responsive to detecting the unmanned aerial vehicletraveling a threshold distance, transform the spherical depth map. Thespherical depth may be transformed such that the center of the sphericaldepth map coincides with the location of the unmanned aerial vehicle.For example, UAV 900 may move from a position shown in FIGS. 9A-9B to aposition shown in FIGS. 9C-9D. In FIGS. 9C-9D, UAV 900 may be underneathblock arch 910. Spherical depth map component 821 may determine that UAV900 has traveled from the position shown in FIGS. 9A-9B to the positionshown in FIGS. 9C-9D (based on readings from a motion and orientationsensor, visual odometry, etc.). In response to the translationalmovement of UAV 900 exceeding a certain threshold, spherical depth mapcomponent 821 may transform the spherical depth map to keep thespherical depth map centered at UAV 900.

Transformation of the spherical depth map may allow spherical depth mapsystem 800 to keep track of distances that may not be currentlyobservable. For example, a sensor of UAV 900 used to determine depthinformation for the environment surrounding UAV 900 may be limited to adegree above the horizontal. In FIG. 9B, UAV 900 may be able todetermine depth information for surface A 912 of block arch 910. In FIG.9D, UAV 900 may not be able to determine depth information for surface A912 of block arch 910 because surface A 910 is outside the sensor rangeof UAV 900. UAV 900 relying sole on present sensor data to determinedepth information may not detect surface A 912 and may not be able toprovide maneuver controls (e.g., collision warning, keeping a safedistance away from surface A 912, indication of distance to surface A912, etc.) for UAV 900 based on surface A 912. Transformation of thespherical depth map may allow UAV 900 to remember depth information forsurface A 912 when it moves from the position shown in FIGS. 9A-9B tothe position shown in FIGS. 9C-9D.

Transformation of the spherical depth map may introduce discretizationerrors due to low angular resolution of the spherical depth map. Toreduce the influence of these errors, the spherical depth map may beonly updated after a certain minimal translation threshold. Fasttransformation of the spherical depth map be performed byforward-propagating the last known position of UAV 900 using motion andorientation sensor measurements and current motion and orientationsensor bias estimate. In some implementations, a spherical depth map maybe transformed as described in U.S. Provisional Patent Application No.62/203,754, entitled “SYSTEM AND METHOD FOR OBSTACLE DETECTION FORMOBILE ROBOTS,” filed on Aug. 11, 2015, incorporated supra.

Maneuver controls component 822 may be configured to provide maneuvercontrols for the unmanned aerial vehicle. The maneuver controls for theunmanned aerial vehicle may be based on the spherical depth map. Themaneuver controls for the unmanned aerial vehicle may include providingadditional information relating to the unmanned aerial vehicle(environment around the unmanned aerial vehicle) based on the sphericaldepth map (e.g., detecting closest objects in the environment around theunmanned aerial vehicle, determining distances to closest objects in theenvironment around the unmanned aerial vehicle, providing warnings aboutclosest objects in the environment around the unmanned aerial vehicle,etc.), providing control limits for the unmanned aerial vehicle based onthe spherical depth map (e.g., restricting one or more of speed,heading, or distance to closest objects in the environment around theunmanned aerial vehicle, etc.), and/or other maneuver controls. In someimplementations, the maneuver controls for the unmanned aerial vehiclemay include controlling the unmanned aerial vehicle to avoid one or moreobjects in the environment around the unmanned aerial vehicle. Forexample, in FIGS. 9C-9D, maneuver controls component 822 may provideinformation about surface A 912 and/or restrict UAV 900 from flyingnear/into surface A 912.

In some implementations, maneuver controls component 822 may provide forobstacle detection. In some implementations, obstacle detection may beperformed as described in U.S. Provisional Patent Application No.62/203,754, entitled “SYSTEM AND METHOD FOR OBSTACLE DETECTION FORMOBILE ROBOTS,” filed on Aug. 11, 2015, incorporated supra. Maneuvercontrols component 822 may extract all values in the spherical depth mapwhich are closer than a certain threshold (e.g., extract distance valuescloser than a certain distance). In some implementations, maneuvercontrols component 822 may provide for obstacle detection by determiningdistances at given bearing angles. For example, UAV 900 in FIGS. 9C-9Dmay be programmed to rise up. Maneuver controls component 822 maydetermine distances to surrounding objects (e.g., surface A 912 of blockarch 910) by checking the depth information stored in the map cellscorresponding to the programmed flight path. Based on the retrieval ofdistance to surface A 912, maneuver controls component 822 may detectsurface A 912. In some implementations, information contained in aspherical depth map may be accessed as described in U.S. ProvisionalPatent Application No. 62/203,754, entitled “SYSTEM AND METHOD FOROBSTACLE DETECTION FOR MOBILE ROBOTS,” filed on Aug. 11, 2015,incorporated supra.

Referring to FIG. 12, predicted path system 1200 for collision avoidanceof an unmanned aerial vehicle may include one or more of processor 1211,electronic storage 1214, interface 1217, and/or other components. Insome implementations, predicted path system 1200 may include one or moreof stereo image sensor 1212. In some implementations, predicted pathsystem 1200 may include one or more of motion and orientation sensor1213. A stationary object and/or a moving object in an environmentaround the unmanned aerial vehicle may be detected. Depth informationfor the environment around the unmanned aerial vehicle may be obtained.The depth information may characterize distances between the unmannedaerial vehicle and the environment around the unmanned aerial vehicle,including the detected stationary object and/or the detected movingobject. A depth map may be generated from the depth information. Theaccuracy of the depth map may be characterized by a depth map accuracy.Vehicle physical model information may be obtained. The vehicle physicalmodel may characterize a motion of the unmanned aerial vehicle. Theaccuracy of the vehicle physical model information may be characterizedby a vehicle physical model accuracy. A predicted vehicle path may bedetermined based on the vehicle physical model information and thevehicle physical model accuracy.

In response to detecting the stationary object, a potential collision ofthe unmanned aerial vehicle with the stationary object may be determinedbased on the depth map, the depth map accuracy, and the predictedvehicle path. In response to detecting the moving object, moving objectphysical model information may be obtained. The moving object physicalmodel may characterize a motion of the moving object. The accuracy ofthe moving object physical model may be characterized by a moving objectphysical model accuracy. A predicted moving object path may bedetermined based on the depth map, the depth map accuracy, the movingobject physical model information, and the moving object physical modelaccuracy. A potential collision of the unmanned aerial vehicle with themoving object may be determined based on the predicted vehicle path andthe predicted moving object path. In response to detecting a collisionof the unmanned aerial vehicle, a velocity of the unmanned aerialvehicle may be changed.

Stereo image sensor 1212 may be carried by the unmanned aerial vehicle.Stereo image sensor 1212 may include first image sensor 1231, secondimage sensor 1232, and/or other components. First image sensor 1231 maybe configured to generate first visual output signals. The first visualoutput signals may convey visual information within a field of view offirst image sensor 1231. Visual information may include one or more ofan image, a video, and/or other visual information. First image sensor1231 may include one or more of a charge-coupled device sensor, anactive pixel sensor, a complementary metal-oxide semiconductor sensor,an N-type metal-oxide-semiconductor sensor, and/or other image sensors.Second image sensor 1232 may be configured to generate second visualoutput signals. The second visual output signals may convey visualinformation within a field of view of second image sensor 1232. Secondimage sensor 1232 may include one or more of a charge-coupled devicesensor, an active pixel sensor, a complementary metal-oxidesemiconductor sensor, an N-type metal-oxide-semiconductor sensor, and/orother image sensors.

Motion and orientation sensor 1213 may be configured to generate motionand orientation output signals. Motion and orientation output signalsmay convey motion information and orientation information of theunmanned aerial vehicle. Motion information may include one or moremotion information regarding speed of the unmanned aerial vehicle,distance traveled by the unmanned aerial vehicle, and/or movement of theunmanned aerial vehicle, including one or more of moving forward, movingbackwards, moving right, moving left, moving up, moving down, othermovements, and/or other motion information. Orientation information mayinclude one or more orientation information regarding orientation of theunmanned aerial vehicle, including one or more of turning right, turningleft, rolling right, rolling left, pitching up, pitching down, and/orother orientation information. Motion information and/or orientationinformation may be processed to obtain motions and/or orientations ofthe unmanned aerial vehicle at particular times and/or locations. Motionand orientation sensor 1213 may include one or more of an accelerometer,a tilt sensor, an inclination sensor, an angular rate sensor, agyroscope, an inertial measurement unit, a compass, a magnetometer, apressure sensor, a barometer, a global positioning system device, adistance sensor, and/or other motion and orientation sensors.

Electronic storage 1214 may include electronic storage media thatelectronically stores information. Electronic storage 1214 may storesoftware algorithms, information determined by processor 1211,information received remotely, and/or other information that enablespredicted path system 1200 to function properly. For example, electronicstorage 1214 may store depth information (as discussed elsewhereherein), and/or other information.

Processor 1211 may be configured to provide information processingcapabilities in predicted path system 1200. As such, processor 1211 maycomprise one or more of a digital processor, an analog processor, adigital circuit designed to process information, a central processingunit, a graphics processing unit, a microcontroller, an analog circuitdesigned to process information, a state machine, and/or othermechanisms for electronically processing information. By way ofnon-limiting example, a microcontroller may include one or more of 8051,PIC, AVR, ARM microcontroller, and/or other microcontrollers. In someimplementations, processor 1211 may include a plurality of processingunits. In some implementations, processor 1211 may be coupled with oneor more of RAM, ROM, input/output ports, and/or other peripherals.

Processor 1211 may be configured by machine-readable instructions toexecute one or more computer program components. The computer programcomponents may include one or more of object detection component 1220,depth information component 1221, depth map component 1222, physicalmodel information component 1223, predicted path component 1224,collision detection component 1225, collision avoidance component 1226,and/or other computer program components. In some implementations,processor 1211 may include one or more of a hardware-implementedprocessor, a software-implemented processor, and/or other processors. Insome implementations, one or more hardware-implemented processors may belocated remotely from one or more software-implemented processors.

Object detection component 1220 may be configured to detect one or morestationary objects in an environment around the unmanned aerial vehicle.Object detection component 1220 may be configured to detect one or moremoving objects in the environment around the unmanned aerial vehicle.For example, FIG. 13A illustrates an exemplary environment around anunmanned aerial vehicle (UAV 1300). Object detection component 1220 maydetect stationary object 1310, moving object 1320, and/or other objects.

In some implementations, object detection component 1220 may beconfigured to identify one or more object types of one or more detectedobjects. An object type may refer to one or more categories of objectsthat have one or more common characteristics. A common characteristicmay refer to a permanent or temporary feature of an object relating tothe shape, motion, behavior and/or other features of the object. Commoncharacteristics may pertain to an entire object, or one or more portionsof the object. Non-limiting examples of object types include a person, achild, an adult, one or more parts of a person (e.g., arm, hand, head,body, leg, feet, etc.), an animal, a particular kind of animal, one ormore parts of an animal, a vehicle, a particular kind of vehicle, one ormore parts of a vehicle (e.g., wheel, door, engine, trunk, window, wing,propeller, rotor, etc.), a stationary object, one or more parts of astationary object, and other object types.

Depth information component 1221 may be configured to obtain depthinformation for an environment around the unmanned aerial vehicle and/orother information. Depth information component 1221 may obtain depthinformation from one or more sensors carried by the unmanned aerialvehicle, one or more sensors located remotely from the unmanned aerialvehicle, and/or from other locations. For example, depth informationcomponent 1221 may obtain depth information from a distance sensorcarried by the unmanned aerial vehicle, a distance sensor locatedremotely from the unmanned aerial vehicle, and/or from an electronicstorage containing depth information for the environment around theunmanned aerial vehicle. The depth information may characterize one ormore distances between the unmanned aerial vehicle and the environmentaround the unmanned aerial vehicle. The environment around the unmannedaerial vehicle may include one or more detected stationary objectsand/or one or more detected moving objects. For example, depthinformation component 1221 may obtain depth information characterizingone or more distances between UAV 1300 and stationary object 1310 andone or more distances between UAV 1300 and moving object 1320.

The depth information may include one or more of distance information,disparity information, and/or other depth information. For example,depth information component 1221 may obtain one or more of distanceinformation, disparity information, and/or other depth information forstationary object 1310 and moving object 1320. One or more of distanceinformation, disparity information, and/or other depth information maybe obtained from and/or through use of distance sensors. Distancesensors may include sensors that provide depth information for objectsaround the distance sensors. As non-limiting examples, distance sensorsmay include one or more of image sensors, infrared distance sensors,laser rangefinders, Lidar, ultrasonic distance sensors, range cameras,and/or other distance sensors. For example, depth information component1221 may be configured to compare the visual information from one imagesensor (e.g., first image sensor 1231) with the visual information fromanother image sensor (e.g., second image sensor 1232) to determine thedepth information and/or other information.

Depth map component 1222 may be configured to generate a depth map fromthe depth information and/or other information. The depth map maycharacterize distances between the unmanned aerial vehicle and one ormore detected stationary objects and/or one or more detected movingobjects. For example, depth map component 1222 may generate depth mapcharacterizing distances between one or more distances between UAV 1300and stationary object 1310 and one or more distances between UAV 1300and moving object 1320. In some implementations, the depth may include astatistical voxel-map, an Octomap, a spherical depth map, and/or otherdepth maps.

The accuracy of the depth map may be characterized by a depth mapaccuracy and/or other information. The depth map accuracy may beinherent in one or more of sensor(s) used to capture the depthinformation, algorithms used to capture/transform/store the depthinformation, algorithms used to create a depth map, and/or otherprocesses/components used to capture depth information/create the depthmap. The depth map accuracy may be obtained from electronic storage 1214and/or obtained from other locations. The depth map accuracy may becalculated.

The depth map accuracy may provide information about accuracy of thedistances between the unmanned aerial vehicle and the detected objectscharacterized by the depth map with respect to actual distances betweenthe unmanned aerial vehicle and the detected objects. The depth mapaccuracy may provide information about accuracy of the distance betweenthe unmanned aerial vehicle and one or more detected stationary objectscharacterized by the depth map with respect to actual distances betweenthe unmanned aerial vehicle and one or more detected stationary objects.For example, the depth map accuracy may provide information aboutaccuracy of distance between UAV 1300 and stationary object 1310. Basedon the depth map accuracy, stationary object 1310 may be determined tobe located within a range of locations, indicated as stationary objectaccuracy range 1315.

The depth map accuracy may provide information about accuracy of thedistance between the unmanned aerial vehicle and characterized by thedepth map with respect to actual distances between the unmanned aerialvehicle and one or more detected moving object. For example, the depthmap accuracy may provide information about accuracy of distance betweenUAV 1300 and moving object 1320. Based on the depth map accuracy, movingobject 1325 may be determined to be located within a range of locations,indicated as moving object accuracy range 1325.

Physical model information component 1223 may be configured to obtainvehicle physical model information and/or other information. The vehiclephysical model information may characterize a motion of the unmannedaerial vehicle. For example, physical model information component 1223may obtain vehicle physical model information for UAV 1300. The vehiclephysical model information may be obtained from UAV 1300, obtained fromone or more sensors measuring movement of UAV 1300, obtained fromelectronic storage 1214 and/or from other locations. The vehiclephysical model information may be calculated.

In some implementations, the vehicle physical model information may bebased on the motion and orientation information of the unmanned aerialvehicle and/or other information. In some implementations, the vehiclephysical model information may be based on one or more of a weight ofthe unmanned aerial vehicle, a size of the unmanned aerial vehicle, ashape of the unmanned aerial vehicle, a linear speed of the unmannedaerial vehicle, a linear acceleration of the unmanned aerial vehicle, alinear direction of the unmanned aerial vehicle, an angular speed of theunmanned aerial vehicle, an angular acceleration of the unmanned aerialvehicle, an angular direction of the unmanned aerial vehicle, a motioninstruction for the unmanned aerial vehicle, an environmental conditionaround the unmanned aerial vehicle, and/or other parameters. One or moreof the parameters may be related to one or more other parameters.Physical model information component 1223 may include and/or retrieveinformation (for example, a database, etc.) relating to one or more ofthe parameters and/or one or more relationships between the parameters.

For example, physical model information component 1223 may obtainvehicle physical model information for UAV 1300. The vehicle physicalmodel information for UAV 1300 may characterize motion of UAV 1300,shown as UAV motion 1350 in FIG. 13B. As shown in FIG. 13B, the vehiclephysical model information for UAV 1300 may indicate that UAV 1300 willmove forward by turning right and then turning left.

The accuracy of the vehicle physical model information may becharacterized by a vehicle physical model accuracy and/or otherinformation. The vehicle physical model accuracy may be inherent in oneor more of the unmanned aerial vehicle, one or more parts of theunmanned aerial vehicle, algorithms used to control/move the unmannedaerial vehicle, environmental condition around the unmanned aerialvehicle, and/or sensors used to track movement of the unmanned aerialvehicle and/or surrounding conditions. The vehicle physical modelaccuracy may be determined based on the identity of unmanned aerialvehicle (e.g., model number, etc.) and/or otherprocesses/components/factors (e.g., payload carried by the unmannedaerial vehicle, weather conditions, wind speed, etc.) impactingcontrol/movement of the unmanned aerial vehicle. The vehicle physicalmodel accuracy may be obtained from electronic storage 1214 and/orobtained from other locations. The vehicle physical model accuracy maybe calculated.

The vehicle physical model accuracy may provide information aboutaccuracy of the motion of the unmanned aerial vehicle characterized bythe vehicle physical model information with respect to an actual motionof the unmanned aerial vehicle. For example, the vehicle physical modelaccuracy may provide information about accuracy of UAV motion 1350.

Physical model information component 1223 may be configured to,responsive to detection of one or more moving objects, obtain movingobject physical model information and/or other information. The movingobject physical model information may characterize one or more motionsof one or more moving objects. For example, physical model informationcomponent 1223 may obtain moving object physical model information formoving object 1320. The moving object model information may be obtainedfrom moving object 1320, obtained from one or more sensors measuringmovement of moving object 1320, obtained from electronic storage 1214and/or from other locations. The moving object physical modelinformation may be calculated. In some implementations, the movingobject physical model information may be based on one or more objecttypes of one or more detected moving objects and/or other information.For example, the moving object physical model information for movingobject 1320 may be based on the object type of moving object 1320.

For example, physical model information component 1223 may obtain movingobject physical model information for moving object 1320. The movingobject physical model information for moving object 1320 maycharacterize motion of moving object 1320, shown as moving object motion1355 in FIG. 13B. As shown in FIG. 13B, the moving object physical modelinformation for moving object 1320 may indicate that moving object 1320will move diagonally to the left.

The accuracy of the moving object physical model information may becharacterized by a moving object physical model accuracy and/or otherinformation. The moving object physical model accuracy may be inherentin one or more of the moving object, one or more parts of the movingobject, algorithms used to control/move the moving object, environmentalcondition around the moving object, and/or sensors used to trackmovement of the moving object and/or surrounding conditions. The movingobject physical model accuracy may be determined based on the identityof moving object and/or other processes/components/factors impactingcontrol/movement of the moving object. The moving object physical modelaccuracy may be obtained from electronic storage 1214 and/or obtainedfrom other locations. The moving object physical model accuracy may becalculated.

The moving object physical model accuracy may provide information aboutaccuracy of the motion of one or more moving objects characterized bythe moving object physical model information with respect to the actualmotion of one or more moving objects. For example, the moving objectphysical model accuracy may provide information about accuracy of movingobject motion 1355.

Predicted path component 1224 may be configured to determine a predictedvehicle path. A predicted vehicle path may be determined based on thevehicle physical model information, the vehicle physical model accuracy,and/or other information. Predicted path component 1224 may determine apredicted vehicle path of an unmanned aerial vehicle by taking intoaccount the vehicle physical model information (e.g., measured position,orientation, velocity of the unmanned aerial vehicle, etc.), the vehiclephysical model accuracy (e.g., inaccuracies inposition/orientation/velocity measurement, weather conditions, windspeeds, etc.), and/or other information. The predicted vehicle path mayinclude predicted location(s) of the unmanned aerial vehicle at one ormore future times.

The area encompassed by the predicted vehicle path of the unmannedaerial vehicle may increase with time. The increased area covered by thepredicted vehicle path may account for one or more errors in determiningmotion of the unmanned aerial vehicle and provide for safety margins innavigating the unmanned aerial vehicle to a destination. For example,based on the vehicle physical model information and the vehicle physicalmodel accuracy of UAV 1300, predicted path component 1224 may determinepredicted UAV path 1360, shown in FIG. 13C. Predicted UAV path 1360 maycover an area greater than UAV motion 1350.

Predicted path component 1224 may be configured to, responsive todetection of one or more moving objects, determine one or more predictedmoving object paths. One or more predicted moving object paths may bedetermined based on the depth map, the depth map accuracy, the movingobject physical model information, the moving object physical modelaccuracy, and/or other information. Predicted path component 1224 maydetermine a predicted moving object path of a moving object by takinginto account the moving object physical model information (e.g.,measured position, orientation, velocity of the moving object, etc.),the moving object physical model accuracy and/or the depth map accuracy(e.g., inaccuracies in position/orientation/velocity measurement,weather conditions, wind speeds, etc.), and/or other information. Thepredicted moving object path(s) may include predicted location(s) of themoving object(s) at one or more future times.

The area encompassed by the predicted moving object path of the movingobject may increase with time. The increased area covered by thepredicted moving object path may account for one or more errors indetermining motion of the moving object and provide for safety marginsin navigating the unmanned aerial vehicle to a destination. For example,based on the depth map, the depth map accuracy, the moving objectphysical model information and the moving object physical model accuracyof moving object 1320, predicted path component 1224 may determinepredicted moving object path 1365, shown in FIG. 13C. Predicted movingobject path 1365 may cover an area greater than moving object motion1355.

In some implementations, the predicted motion object path may bedetermined based on the object type of the moving object. The type ofthe moving object may indicate limits of the moving object and/orpredictability/unpredictability of the moving object. For example, basedon an identification of a moving object as a car, predicted pathcomponent 1224 may determine a predicted motion object path based onlimits of a car movement (e.g., cannot move sideways, cannot fly, etc.).As another example, based on an identification of a moving object as achild, predicted path component 1224 may determine a predicted motionobject path based on unpredictability of a child (e.g., may changedirections abruptly without warning). For example, a predicted motionobject path for a child may cover a greater area than a predicted motionobject path for a Ferris wheel to provide for safety margins innavigating the unmanned aerial vehicle near a child. The predictedmotion object path for a child may be limited based on a likely maximumspeed of a child. Other types of prediction of predicted motion objectpath based on the object type of the moving object are contemplated.

Collision detection component 1225 may be configured to, responsive todetecting one or more stationary objects, determine whether the unmannedaerial vehicle moving in the predicted vehicle path may collide with oneor more detected stationary objects. A collision between the unmannedaerial vehicle and one or more detected stationary objects may bedetermined based on the depth map, the depth map accuracy, the predictedvehicle path, and/or other information. One or more collisions may bedetermined at one or more locations and/or at one or more future times.For example, as shown in FIG. 13D, collision detection component 1225may determine that UAV 1300 moving in predicted UAV path 1360 maycollide with stationary object 1310 (shown as collision A 1370) based onthe depth map, the depth map accuracy, predicted UAV path 1360, and/orother information.

Collision detection component 1225 may be configured to, responsive todetection of one or more moving objects, determine whether the unmannedaerial vehicle moving in the predicted vehicle path may collide with oneor more detected moving objects. A collision between the unmanned aerialvehicle and one or more detected moving objects may be determined basedon the predicted vehicle path, the predicted moving object path, and/orother information. One or more collisions may be determined at one ormore locations and/or at one or more future times. For example, as shownin FIG. 13D, collision detection component 1225 may determine,responsive to detection of moving object 1320, determine that UAV 1300moving in predicted UAV path 1360 may collide with moving object 1320(shown as collision B 1375) based on predicted UAV path 1360, predictedmoving object path 1365, and/or other information.

Collision avoidance component 1226 may be configured to, responsive todetermination of one or more collisions of the unmanned aerial vehiclemoving in the predicted vehicle path, change a velocity of the unmannedaerial vehicle to avoid one or more collision. For example, in responseto determining one or more collision of UAV 1300 with station object1310 and/or moving object 1320, collision avoidance component 1226 maychange a velocity of UAV 1300.

In some implementations, changing the velocity of the unmanned aerialvehicle may include changing one or more speeds of the unmanned aerialvehicle. For example, as shown in FIG. 13D, speed of UAV 1300 may bechanged (slowed or increased) so that UAV 1300 passes the point markedas collision B 1375 at different time than moving object 1320. Speed ofUAV 1300 may be changed one or more times. For example, the speed of UAV1300 may be slowed to avoid moving object 1320. Once UAV 1300 has movedpast predicted position(s) of moving object 1320, the speed of UAV 1300may be increased to the original speed or beyond the original speed tomake up for the delay caused in avoiding moving object 1320. In someimplementations, changing the velocity of the unmanned aerial vehiclemay include changing one or more directions of the unmanned aerialvehicle to move the unmanned aerial vehicle in a path that deviates fromthe predicted vehicle path. For example, as shown in FIG. 13E, thedirection of UAV 1300 may be changed so that UAV 1300 does not movethrough locations encompassed within station object accuracy range 1315.The direction of UAV 1300 may be changed one or more times. For example,the direction of UAV 1300 may be shifted to the left to avoid stationaryobject 1310. Once UAV 1300 has moved past stationary object accuracyrange 1315, the direction of UAV 1300 may be changed to put UAV 1300back on the original course. In some implementations, changing invelocity of the unmanned aerial vehicle may include use of Dijkstra'salgorithm and/or other shortest paths algorithms.

In some implementations, changing the velocity of the unmanned aerialvehicle may include changing one or more speeds of the unmanned aerialvehicle and one or more directions of the unmanned aerial vehicle. Forexample, the direction of UAV 1300 may be changed so that UAV moves inchanged predicted UAV path 1380 (shown in FIG. 13E). Changed predictedUAV path 1380 may include different overlap with predicted moving objectpath 1365 than predicted UAV path 1360. The overlap between changedpredicted UAV path 1380 and predicted moving object path 1365 is shownas collision zone 1390. The speed of UAV 1300 may be changed so that UAV1300 and moving object 1320 are not within collision zone 1390 (or thesame part of collision zone 1390) at the same time.

Referring to FIG. 15, ego-motion system 1500 for estimating anego-motion may include one or more of hardware-implemented processor1511, stereo image sensor 1512, motion and orientation sensor 1513,electronic storage 1514, interface 1517, and/or other components. Pairsof stereo images may be obtained at different times using stereo imagesensor 1512. A depth image may be determined based on a comparison of anearlier obtained pair of stereo images. A predicted motion of stereoimage sensor 1512 may be obtained using motion and orientation sensor1513. A prediction of a pair of stereo images at a later time may bedetermined using the depth image and the predicted motion of stereoimage sensor 1512. Error between the pair of predicted stereo images andthe later obtained pair of stereo images may be determined. An estimatedactual motion of stereo image sensor 1512 may be obtained by adjustingthe predicted motion to reduce the error.

Stereo image sensor 1512 may include first image sensor 1531, secondimage sensor 1532, and/or other components. First image sensor 1531 maybe configured to generate first visual output signals. The first visualoutput signals may convey visual information within a field of view ofthe first image sensor 1531. Visual information may include one or moreof an image, a video, and/or other visual information. First imagesensor 1531 may include one or more of a charge-coupled device sensor,an active pixel sensor, a complementary metal-oxide semiconductorsensor, an N-type metal-oxide-semiconductor sensor, and/or other imagesensors. Second image sensor 1532 may be configured to generate secondvisual output signals. The second visual output signals may conveyvisual information within a field of view of second image sensor 1532.Second image sensor 1532 may include one or more of a charge-coupleddevice sensor, an active pixel sensor, a complementary metal-oxidesemiconductor sensor, an N-type metal-oxide-semiconductor sensor, and/orother image sensors.

Motion and orientation sensor 1513 may be configured to generate motionand orientation output signals. The motion and orientation outputsignals may convey motion information and orientation information ofstereo image sensor 1512. Motion information may include one or moremotion information regarding speed of stereo image sensor 1512, distancetraveled by stereo image sensor 1512, and/or movement of stereo imagesensor 1512, including one or more of moving forward, moving backwards,moving right, moving left, moving up, moving down, other movements,and/or other motion information. Orientation information may include oneor more orientation information regarding orientation of stereo imagesensor 1512, including one or more of turning right, turning left,rolling right, rolling left, pitching up, pitching down, and/or otherorientation information. Motion information and/or orientationinformation may be processed to obtain motions and/or orientations ofstereo image sensor 1512 at particular times and/or locations. Motionand orientation sensor 1513 may include one or more of an accelerometer,a tilt sensor, an inclination sensor, an angular rate sensor, agyroscope, an inertial measurement unit, a compass, a magnetometer, apressure sensor, a barometer, a global positioning system device, adistance sensor, and/or other motion and orientation sensors.

Electronic storage 1514 may include electronic storage media thatelectronically stores information. Electronic storage 1514 may storesoftware algorithms, information determined by hardware-implementedprocessor 1511, information received remotely, and/or other informationthat enables ego-motion system 1500 to function properly. For example,electronic storage 1514 may store depth information (as discussedelsewhere herein), and/or other information.

Hardware-implemented processor 1511 may be configured to provideinformation processing capabilities in ego-motion system 1500.Hardware-implemented processor 1511 may include one or morecomputing/processing devices with one or more algorithms/logicsimplemented in hardware to perform one or more functions.Algorithms/logics may be implemented in hardware via machine-readableinstructions (e.g., a hardware description language file, a netlist,etc.). As a non-limiting example, hardware-implemented processor 1511may include one or more of a field-programmable gate array, anapplication-specific integrated circuit, and/or otherhardware-implemented processors. In some implementations,hardware-implemented processor 1511 may include a plurality ofprocessing units. In some implementations, hardware-implementedprocessor 1511 may be coupled with one or more of RAM, ROM, input/outputports, and/or other peripherals.

Hardware-implemented processor 1511 may be configured bymachine-readable instructions to execute one or more functioncomponents. The function components may include one or more of imagingcomponent 1520, depth image component 1521, predicted motion component1522, predicted imaging component 1523, error component 1524, actualmotion component 1525, and/or other function components.

Imaging component 1520 may be configured to obtain one or more imagesand/or other information. One or more images may be determined based onone or more output signals conveying visual information. The images maybe determined by one or more of imaging component 1520, stereo imagesensor 1512, first image sensor 1531, second image sensor 1532, acomputing/processing device coupled to stereo image sensor 1512/firstimage sensor 1531/second image sensor 1532, and/or other components.

Imaging component 1520 may be configured to obtain a first image, asecond image, a third image, a fourth image, and/or other images at sameor different times. The first image may be determined based on the firstvisual output signals such that the first visual output signals used todetermine the first image are generated by first image sensor 1531 at afirst time. The second image may be determined based on the secondvisual output signals such that the second visual output signals used todetermine the second image are generated by second image sensor 1532 atthe first time.

The third image may be determined based on the first visual outputsignals such that the first visual output signals used to determine thefirst image are generated by first image sensor 1531 at a second timethat is subsequent to the first time. The fourth image may be determinedbased on the second visual output signals such that the second visualoutput signals used to determine the second image are generated bysecond image sensor 1532 at the second time.

Imaging component 1520 may be configured to undistort and rectify one ormore images. Imaging component 1520 may be configured to undistort andrectify the first image, the second image, and/or other images. Imagingcomponent 1520 may be configured to undistort and rectify the thirdimage, the fourth image, and/or other images. Images may be undistortedand rectified based on a relative position and orientation of firstimage sensor 1531 and second image sensor 1532.

For example, FIG. 16A illustrates a top-down view of an arrangement ofstereo image sensor 1600, sphere 1620, and block 1625 at time t. FIG.16B illustrates a top-down view of an arrangement of stereo image sensor1600, sphere 1620, and block 1625 at time t+1 (subsequent to time t).Dashed lines in FIGS. 16A-16B extending out from left image sensor 1610and right image sensor 1615 may indicate the middle of fields of view ofleft image sensor 1610 and right image sensor 1615. Between time t andtime t+1, stereo image sensor 1600 may have moved to the right androtated to the left, while sphere 1620 and block 1625 may have remainedstationary.

Imaging component 1520 may obtain images shown in FIGS. 17A-17D. Leftimage at t 1710 (shown in FIG. 17A) may be determined based on visualoutput signals conveying visual information generated by left imagesensor 1610 at time t. Right image at t 1715 (shown in FIG. 17B) may bedetermined based on visual output signals conveying visual informationgenerated by right image sensor 1615 at time t. Left image at t+1 1720(shown in FIG. 17C) may be determined based on visual output signalsconveying visual information generated by left image sensor 1610 at timet+1. Right image at t+1 1725 (shown in FIG. 17D) may be determined basedon visual output signals conveying visual information generated by rightimage sensor 1615 at time t+1. Dashed lines in FIGS. 17A-17D mayindicate the middle of fields of view of left image sensor 1610 andright image sensor 1615.

Depth image component 1521 may be configured to determine one or moredepth images based on one or more comparisons of images. For example,depth image component 1521 may determine a depth image based on acomparison of left image at t 1710 and right image at t 1715. A depthimage may include pixel values characterizing distances between stereoimage sensor 1600 and objects within the compared images (e.g., leftimage at t 1710 and right image at t 1715, etc.). For example, FIG. 18illustrates an exemplary depth image at time t 1800. Depth image at timet 1800 may be determined by depth image component 1521 based on acomparison of left image at t 1710 and right image at t 1715. Depthimage at time t 1800 may include pixel values characterizing distancesbetween stereo image sensor 1600 and sphere 1620, and pixel valuescharacterizing distances between stereo image sensor 1600 and block1625.

Depth images may be determined at a rate different from an imageacquisition rate of stereo image sensor 1600. For example, depth imagecomponent 1521 may determine depth images when the state-estimationframework generates a new keyframe. In some implementations, ego-motionsystem 1500 may use the state-estimation framework described in U.S.Provisional Patent Application No. 62/203,765, entitled “SYSTEM ANDMETHOD FOR VISUAL INSPECTION USING AERIAL ROBOTS,” filed on Aug. 11,2015, the foregoing being incorporated herein by reference in itsentirety. A new keyframe may be generated when motion of stereo imagesensor 1600 exceeds a certain threshold (e.g., threshold translationalmotion, etc.). Such determination of depth images may allow forconsistent mapping of the environment while avoiding continuousre-estimation of depth of previously seen scenes.

In some implementations, one or more depth images may be determinedusing one or more parallel algorithms and/or other algorithms. Aparallel algorithm may assess the depth on a per pixel level andindividual threads may run per pixel. In some implementations, one ormore depth images may be determined using one or more block matchingalgorithms and/or other algorithms. In some implementations, one or moredepth images may be determined using one or more semi-global blockmatching algorithms and/or other algorithms.

Predicted motion component 1522 may be configured to obtain one or morepredicted motion of stereo image sensor 1512 between different times.Predicted motion component 1522 may estimate the motion of stereo imagesensor 1512 between different times. For example, predicted motioncomponent 1522 may be configured to obtain a predicted motion of stereoimage sensor 1512 between time t and time t+1. Prediction motion ofstereo image sensor 1512 between time t and time t+1 may be used todetermine a predicted pose of stereo image sensor 1512 at time t+1. Forexample, FIG. 16C illustrates an exemplary predicted pose of stereoimage sensor 1600. Based on predicted motion of stereo image sensor 1600between t and time t+1, predicted stereo image sensor pose 1630 may bedetermined. Predicted stereo image sensor pose 1630 may be determined bymoving the pose of stereo image sensor 1600 at time t (as shown in FIG.16A) to account for the predicted motion. Predicted pose may be offsetfrom actual pose. For example, in FIG. 16C, predicted stereo imagesensor pose 1630 may be located below and to the left (as seen from atop-down view) of actual pose of stereo image sensor 1600.

The predicted motion of stereo image sensor 1512 may be determined basedon the motion and orientation output signals generated by motion andorientation sensor 1513. The motion and orientation output signals mayconvey motion information and orientation information of stereo imagesensor 1512. In some implementations, the predicted motion may bedetermined based on visual odometry. In some implementations, predictedmotion may be determined by a companion processor (e.g., asoftware-implemented processor, etc.), and/or other components. Acompanion processor may be located local to hardware-implementedprocessor 1511 or located remotely from hardware-implemented processor1511. Predicted motion component 1522 may obtain predicted motion fromthe companion processor.

Predicted imaging component 1523 may be configured to determine one ormore predicted images. Predicted images may estimate how a scene mayappear from one or more image sensors based on motion of image sensorsbetween different times (e.g., estimated views of a scene from apredicted pose, etc.). For example, one or more predicted images may bedetermined based on one or more predicted motion of stereo image sensor1512 between time t and time t+1, one or more depth images, and/or otherinformation. For example, predicted imaging component 1523 may determinepredicted left image at t+1 1920 (shown in FIG. 19A) by adjusting leftimage at t 1710 based on the predicted motion, depth image at t 1800,and/or other information. Predicted left image at t+1 1920 may show ascene of sphere 1620 and block 1625 as would be observed by left imagesensor 1610 from predicted stereo image sensor pose 1630. Predictedimaging component 1523 may determine predicted right image at t+1 1925(shown in FIG. 19B) by adjusting right image at t 1715 based on thepredicted motion, depth image 1800, and/or other information. Predictedright image at t+1 1925 may show a scene of sphere 1620 and block 1625as would be observed by right image sensor 1615 from predicted stereoimage sensor pose 1630.

Error component 1524 may be configured to determine one or more errordistributions of one or more predicted motion by comparing one or morecaptured images with one or more predicted images. Error component 1524may determine one or more differences between captured images determinedat a certain time and predicted images determined for the certain time.For example, error component 1524 may determine one or more differencesbetween images captured by stereo image sensor 1600 at time t+1 andimages predicted for time t+1 by predicted imaging component 1523. Forexample, error component 1524 may determine an error distribution of thepredicted motion of stereo image sensor 1512 between time t and time t+1by comparing left image at t+1 1720 with predicted left image at t+11920 and right image at t+1 1725 with predicted right image at t+1 1925.

Actual motion component 1525 may be configured to obtain one or moreestimated actual motion of stereo image sensor 1512 between differenttimes. For example, actual motion component 1525 may be configured toobtain an estimated actual motion of stereo image sensor 1512 betweentime t and time t+1. The estimated actual motion may be determined byadjusting the predicted motion to reduce the error distribution of thepredicted motion. For example, for individual pixels, error may becomputed between the predicted and measured intensities. The predictedmotion may be adjusted to minimize the intensity errors between apredicted image (e.g., predicted left image at t+1 1920, etc.) and acaptured image (e.g., left image at t+1 1720, etc.). Minimizing theintensities errors may shift the predicted stereo image sensor location1630 closer to the actual pose of stereo image sensor 1600.

In some implementations, the predicted motion may be adjusted based onone or more absolute error thresholds and/or other information. Forexample, the predicted motion may be adjusted until the errordistribution (or a majority/certain percentage of the errordistribution) of the predicted motion is below a certain errorthreshold. In some implementations, the predicted motion may be adjustedbased on one or more relative error thresholds and/or other information.For example, the predicted motion may be adjusted until the change inerror distribution (or a change in majority/certain percentage of theerror distribution) is below a certain error threshold.

In some implementations, the adjustment of the predicted motion may beperformed over a fixed number of image key frames. Image key frames maybe arbitrarily spaced in time and may be related to each over by errorterms. Such an approach may avoid pose drift during location motions(such as hovering by an unmanned aerial vehicle carrying image sensors)while tracking dynamic motions.

In some implementations, the estimated actual motion of stereo imagesensor 1512 between time t and time t+1 may include an estimatedposition of stereo image sensor 1512 at time t+1, an estimatedorientation of stereo image sensor 1512 at time t+1, an estimatedvelocity of stereo image sensor 1512 at time t+1, and/or other estimatedactual motion of stereo image sensor 1512.

In some implementations, the estimated actual motion of stereo imagesensor 1512 between time t and time t+1 may be used to estimate one ormore of an estimated gravity-aligned position of stereo image sensor1512 at time t+1, an estimated gravity-aligned orientation of stereoimage sensor 1512 at time t+1, an estimated gravity-aligned velocity ofstereo image sensor 1512 at time t+1, and/or other estimatedgravity-aligned motion of stereo image sensor 1512. Estimatedgravity-aligned motion of stereo image sensor 1512 may be estimatedusing an Extended Kalman filter. The Extended Kalman filter may run on asoft-core of hardware-implemented processor 1511 or a companionprocessor (e.g., a software-implemented processor, etc.).

A depth image may be transformed into a three-dimensional point-cloud. Astatistical voxel map, an Octomap, a spherical depth map, and/or otherdepth map framework may be used. In some implementations, a depth imagemay be converted into a three-dimensional point-cloud as described inU.S. Provisional Patent Application No. 62/203,765, entitled “SYSTEM ANDMETHOD FOR VISUAL INSPECTION USING AERIAL ROBOTS,” filed on Aug. 11,2015, incorporated supra.

Based on the estimated actual motion of stereo image sensor 1512,obstacle avoidance and/or trajectory planning may be provided for stereoimage sensor 1512 (and/or objects carrying stereo image sensor 1512,such as an unmanned aerial vehicle). In some implementations, obstacleavoidance and/or trajectory planning may be provided using potentialfield algorithm, algorithms discussed herein, and/or other algorithms.In some implementations obstacle, avoidance and/or trajectory planningmay be provided as described in U.S. Provisional Patent Application No.62/203,765, entitled “SYSTEM AND METHOD FOR VISUAL INSPECTION USINGAERIAL ROBOTS,” filed on Aug. 11, 2015, incorporated supra.

Referring to FIG. 21, depth refining system 2100 for refining a depthimage may include one or more of processor 2111, stereo image sensor2112, electronic storage 2114, interface 2117, and/or other components.In some implementations, the system may include one or more of motionand orientation sensor 2113. Images may be obtained at different timesusing stereo image sensor 2112. A depth image may be determined based ona comparison of an earlier obtained pair of stereo images. The depthimage may be used to predict one of the earlier obtained pair of stereoimages. The depth image may be refined based on a comparison of theearlier obtained stereo image and the prediction of the stereo image atthe earlier time. A predicted motion of stereo image sensor 2112 may bedetermined. A prediction of a stereo image at a later time may bedetermined using the depth image and the predicted motion of stereoimage sensor 2112. The depth image may be refined based on a comparisonof the later obtained stereo image and the prediction of the stereoimage at the later time.

Stereo image sensor 2112 may include first image sensor 2131, secondimage sensor 2132, and/or other components. First image sensor 2131 maybe configured to generate first visual output signals. The first visualoutput signals may convey visual information within a field of view offirst image sensor 2131. Visual information may include one or more ofan image, a video, and/or other visual information. First image sensor2131 may include one or more of a charge-coupled device sensor, anactive pixel sensor, a complementary metal-oxide semiconductor sensor,an N-type metal-oxide-semiconductor sensor, and/or other image sensors.Second image sensor 2132 may be configured to generate second visualoutput signals. The second visual output signals may convey visualinformation within a field of view of second image sensor 2132. Secondimage sensor 2132 may include one or more of a charge-coupled devicesensor, an active pixel sensor, a complementary metal-oxidesemiconductor sensor, an N-type metal-oxide-semiconductor sensor, and/orother image sensors.

Motion and orientation sensor 2113 may be configured to generate motionand orientation output signals. The motion and orientation outputsignals may convey motion information and orientation information ofstereo image sensor 2112. Motion information may include one or moremotion information regarding speed of stereo image sensor 2112, distancetraveled by stereo image sensor 2112, and/or movement of stereo imagesensor 2112, including one or more of moving forward, moving backwards,moving right, moving left, moving up, moving down, other movements,and/or other motion information. Orientation information may include oneor more orientation information regarding orientation of stereo imagesensor 2112, including one or more of turning right, turning left,rolling right, rolling left, pitching up, pitching down, and/or otherorientation information. Motion information and/or orientationinformation may be processed to obtain motions and/or orientations ofstereo image sensor 2112 at particular times and/or locations. Motionand orientation sensor 2113 may include one or more of an accelerometer,a tilt sensor, an inclination sensor, an angular rate sensor, agyroscope, an inertial measurement unit, a compass, a magnetometer, apressure sensor, a barometer, a global positioning system device, adistance sensor, and/or other motion and orientation sensors.

Electronic storage 2114 may include electronic storage media thatelectronically stores information. Electronic storage 2114 may storesoftware algorithms, information determined by processor 2111,information received remotely, and/or other information that enablesdepth refining system 2100 to function properly. For example, electronicstorage 2114 may store information relating to depth image (as discussedelsewhere herein), and/or other information.

Processor 2111 may be configured to provide information processingcapabilities in depth refining system 2100. As such, processor 2111 maycomprise one or more of a digital processor, an analog processor, adigital circuit designed to process information, a central processingunit, a graphics processing unit, a microcontroller, an analog circuitdesigned to process information, a state machine, and/or othermechanisms for electronically processing information. By way ofnon-limiting example, a microcontroller may include one or more of 8051,PIC, AVR, ARM microcontroller, and/or other microcontrollers. In someimplementations, processor 2111 may include a plurality of processingunits. In some implementations, processor 2111 may be coupled with oneor more of RAM, ROM, input/output ports, and/or other peripherals.

The Processor 2111 may be configured by machine-readable instructions toexecute one or more computer program components. The computer programcomponents may include one or more of imaging component 2120, depthimage component 2121, predicted motion component 2122, predicted imagingcomponent 2123, refine component 2124, and/or other computer programcomponents. In some implementations, processor 2111 may include one ormore of a hardware-implemented processor, a software-implementedprocessor, and/or other processors. In some implementations, one or morehardware-implemented processors may be located remotely from one or moresoftware-implemented processors.

Imaging component 2120 may be configured to obtain one or more imagesand/or other information. One or more images may be determined based onone or more output signals conveying visual information. The images maybe determined by one or more of imaging component 2120, stereo imagesensor 2112, first image sensor 2131, second image sensor 2132, acomputing/processing device coupled to stereo image sensor 2112/firstimage sensor 2131/second image sensor 2132, and/or other components.

Imaging component 2120 may be configured to obtain a first image, asecond image, a third image, a fourth image, and/or other images. Thefirst image may be determined based on the first visual output signalssuch that the first visual output signals used to determine the firstimage are generated by first image sensor 2131 at a first time. Thesecond image may be determined based on the second visual output signalssuch that the second visual output signals used to determine the secondimage are generated by second image sensor 2132 at the first time.

The third image may be determined based on the first visual outputsignals such that the first visual output signals used to determine thefirst image are generated by first image sensor 2131 at a second timethat is subsequent to the first time. The fourth image may be determinedbased on the second visual output signals such that the second visualoutput signals used to determine the second image are generated bysecond image sensor 2132 at the second time.

Imaging component 2120 may be configured to undistort and rectify one ormore images. Imaging component 2120 may be configured to undistort andrectify the first image, the second image, and/or other images. Imagingcomponent 2120 may be configured to undistort and rectify the thirdimage, and/or other images. The imaging component may be configured toundistort and rectify the fourth image, and/or other images. Images maybe undistorted and rectified based on a relative position andorientation of first image sensor 2131 and second image sensor 2132.

For example, first image sensor 2131 and second image sensor 2132 may beused to obtain left image at t 1710, right image at t 1715, left imageat t+1 1720, right image at t+1 1725 (shown in FIGS. 17A-17D) based onposes of stereo image sensor 2112 at time t and at time t+1. Poses ofstereo image sensor 2112 time t and time t+1 may be the same or similarto poses of stereo image sensor 1600 shown in FIGS. 16A-16B.

The depth image component 2121 may be configured to determine one ormore depth images based on one or more comparisons of images. Forexample, depth image component 2121 may determine depth image at t 2200(shown in FIG. 22A) based on a comparison of left image at t 1710 andright image at t 1715. Depth image at t 2200 image may include pixelvalues characterizing distances between stereo image sensor 2112 andobjects within the compared images (e.g., left image at t 1710 and rightimage at t 1715, etc.). In some implementations, depth image component2121 may determine depth image at t+1 2205 (shown in FIG. 22B) based ona comparison of left image at t+1 1720 and right image at t+1 1725.Depth image at t+1 may include pixel values characterizing distancesbetween stereo image sensor 2112 and objects within the compared images.

Predicted motion component 2122 may be configured to obtain one or morepredicted motion of stereo image sensor 2112 between different times.Predicted motion component 2122 may estimate the motion of stereo imagesensor 2112 between different times. For example, predicted motioncomponent 2122 may be configured to obtain a predicted motion of stereoimage sensor 2112 between time t and time t+1. Predicted motion ofstereo image sensor 2112 between time t and time t+1 may be used todetermine a predicted pose of stereo image sensor 2112 at time t+1. Insome implementations, the predicted motion of stereo image sensor 2112may be determined based on the motion and orientation output signalsgenerated by motion and orientation sensor 2113. The motion andorientation output signals may convey motion information and orientationinformation of stereo image sensor 2112. In some implementations, thepredicted motion may be determined based on visual odometry. In someimplementations, the predicted motion may include estimated actualmotion described above.

Predicted imaging component 2123 may be configured to determine one ormore predicted images. Predicted images may estimate how a scene mayappear from one or more image sensors from a particular/predicted pose.For example, one or more predicted images may be determined based on oneor more of predicted motion of stereo image sensor 2112, depth images,and/or other information. For example, predicted imaging component 2123may determine predicted right image at t 2320 (shown in FIG. 23A) byadjusting left image at t 1710 based on depth image at t 2200, and/orother information. Given left image at t 1720 and depth image at t 2200,predicted imaging component 2123 may predict how the scene may appear tosecond image sensor 2132 at time t.

Predicted imaging component 2123 may determine predicted left image att+1 2325 by adjusting left image at t 1710 based on the predictedmotion, the depth image at t 2200, and/or other information. Given leftimage at t 1720, depth image at t 2200, and prediction motion, predictedimaging component 2123 may predict how the scene may appear to firstimage sensor 2131 at time t+1. In some implementations, predicted leftimage at t+1 2325 may be determined using an inverse compositionalwarping approach as described in U.S. Provisional Patent Application No.62/203,745, entitled “SYSTEM AND METHOD FOR MOBILE ROBOT NAVIGATION,”filed on Aug. 11, 2015, the foregoing being incorporated herein byreference in its entirety.

In some implementations, predicted imaging component 2123 determinepredicted depth image at t+1 2400 (shown in FIG. 24) by adjusting depthimage at t 2200 based on the predicted motion and/or other information.Given depth image at t 2200 and the predicted motion, predicted imagingcomponent 2123 may predict how the depth image may appear at time t+1.

Refine component 2124 may be configured to refine one or more depthimages based on one or more comparisons of images with predicted images.Refine component 2124 may keep depth values of depth images thatgenerate a good prediction of images and remove depth values of depthimages that do not generate a good prediction of images. For example,refine component 2124 may be configured to refine depth image at t 2200based on a comparison of right image at t 2310 (obtained using secondimage sensor 2132) and predicted right image at t 2320 (obtained byadjusting left image at t 1710 using depth image at t 2200). In someimplementations, refining depth image at t 2200 based on the comparisonof right image at t 2310 with predicted right image at t 2320 mayinclude determining one or more mismatches between an image patch inright image at t 2310 with a corresponding image patch in predictedright image at t 2320. Responsive to the mismatch(es) meeting and/orexceeding an error threshold, one or more pixel value of depth image att 2200 corresponding to the image patch may be rejected.

Refine component 2124 may be configured to refine depth image at t 2200based on a comparison of left image at t+1 2315 (obtained using firstimage sensor 2131) with predicted left image at t+1 2325 (obtained byadjusting left image at t 1710 using depth image at t 2200 and predictedmotion). In some implementations, refining depth image at t 2200 basedon the comparison of left image at t+1 2315 with predicted left image att+1 2325 may include determining one or more mismatches between an imagepatch in left image at t+1 2315 with a corresponding image patch inpredicted left image at t+1 2325. Responsive to the mismatch(es) meetingand/or exceeding an error threshold, one or more pixel values of depthimage at t 2200 corresponding to the image patch may be rejected.

In some implementations, refine component 2124 may be configured torefine depth image at t+1 2205 based on a comparison of depth image att+1 2205 with predicted depth image at t+1 2400. In someimplementations, refining refine depth image at t+1 2205 based on thecomparison of refine depth image at t+1 2205 with predicted depth imageat t+1 2400 may include determining one or more mismatches between apixel value in depth image at t+1 2205 with a corresponding pixel valuein predicted depth image at t+1 2400. Responsive to the mismatch(es)meeting and/or exceeding an error threshold, a pixel value of depthimage at t+1 2205 corresponding to the mismatch may be rejected. In someimplementations, responsive to the mismatch(es) meeting and/or exceedingan error threshold, refine component 2124 may refine depth image at t2200.

In some implementations, the error threshold may be based on inverse ofthe distances between stereo image sensor 2112 and the objects. In someimplementations, the mismatch between an image patch in an image withthe corresponding image patch in a predicted image may be determinedbased on an error distribution. In some implementations, an image may becompared with a predicted image as described in U.S. Provisional PatentApplication No. 62/203,745, entitled “SYSTEM AND METHOD FOR MOBILE ROBOTNAVIGATION,” filed on Aug. 11, 2015, incorporated supra.

While the present disclosure may be directed to unmanned aerialvehicles, one or more other implementations of the system may beconfigured for other types vehicles. Other types of vehicles may includea passenger vehicle (e.g., a car, a bike, a boat, an airplane, etc.), anon-passenger vehicle, and/or a remoted controlled vehicle (e.g., remotecontrolled airplane, remote controlled car, remoted controlledsubmarine, etc.).

Although all components of vehicle 10 are shown to be located in vehicle10 in FIG. 1, some or all of the components may be installed in vehicle10 and/or be otherwise coupled with vehicle 10. In some implementations,hardware-implemented processor 11 may be located remotely from vehicle10. In some implementations, hardware-implemented processor 11 may belocated on or in vehicle 10. In some implementations, image sensor 12may be located remotely from vehicle 10. In some implementations, imagesensor 12 may be located on or in vehicle 10.

Although components are shown to be connected to interface (17, 1217,1517, 2127) in FIGS. 1, 12, 15, and 21, any communication medium may beused to facilitate interaction between any components depicted in FIGS.1, 12, 15, and 21. One or more components may communicate with eachother through hard-wired communication, wireless communication, or both.By way of non-limiting example, wireless communication may include oneor more of radio communication, Bluetooth communication, Wi-Ficommunication, cellular communication, infrared communication, or otherwireless communication.

Although hardware-implemented processor (11, 1511) and processor (1211,2111) are shown in FIGS. 1, 12, 15, and 21 as a single entity, this isfor illustrative purposes only. In some implementations,hardware-implemented processor/processor may comprise a plurality ofprocessing units. These processing units may be physically locatedwithin the same device. Hardware-implemented processor/processor mayrepresent processing functionality of a plurality of devices operatingin coordination.

It should be appreciated that although various components (computerprogram components, function components) are illustrated in FIGS. 1, 12,15, and 21 as being co-located within a single processing unit, inimplementations in which the hardware-implemented processor/processorcomprises multiple processing units, one or more of the components maybe located remotely from the other components.

The description of the functionality provided by the different computerprogram/function components described herein is for illustrativepurposes, and is not intended to be limiting, as any of computerprogram/function components may provide more or less functionality thanis described. For example, one or more of computer program/functioncomponents may be eliminated, and some or all of its functionality maybe provided by other computer program/function components. As anotherexample, one or more additional computer program/function componentsthat may perform some or all of the functionality attributed to one ormore of computer program/function components.

Although image sensor 12, stereo image sensor (812, 1212, 1512, 2112),first image sensor (831, 1231, 1531, 2131), second image sensor (832,1232, 1532, 2132) are depicted in FIGS. 1, 12, 15, and 21 as a singleelement, this is not intended to be limiting. Image sensor/stereo imagesensor may include one or more image sensors in one or more locations.

Although motion and orientation sensors (13, 1213, 1513, 2113) aredepicted in FIGS. 1, 12, 15, and 21 as a single element, this is notintended to be limiting. Motion and orientation sensor may include oneor more motion and orientation sensors in one or more locations.

The electronic storage media of electronic storage (14, 1214, 1514,2114) may be provided integrally (i.e., substantially non-removable)with one or more components and/or removable storage that is connectableto one or more components via, for example, a port (e.g., a USB port, aFirewire port, etc.) or a drive (e.g., a disk drive, etc.). Electronicstorage (14, 1214, 1514, 2114) may include one or more of opticallyreadable storage media (e.g., optical disks, etc.), magneticallyreadable storage media (e.g., magnetic tape, magnetic hard drive, floppydrive, etc.), electrical charge-based storage media (e.g., EPROM,EEPROM, RAM, etc.), solid-state storage media (e.g., flash drive, etc.),and/or other electronically readable storage media. Electronic storage(14, 1214, 1514, 2114) may be a separate component or may be providedintegrally with one or more other components shown in FIGS. 1, 12, 15,and 21 (e.g., hardware-implemented processor 11). Although electronicstorage (14, 1214, 1514, 2114) is shown in FIGS. 1, 12, 15, and 21 as asingle entity, this is for illustrative purposes only. In someimplementations, electronic storage (14, 1214, 1514, 2114) may comprisea plurality of storage units. These storage units may be physicallylocated within the same device, or electronic storage (14, 1214, 1514,2114) may represent storage functionality of a plurality of devicesoperating in coordination.

FIG. 7 illustrates method 700 for detecting a moving object usingadjusted optical flow. The operations of method 700 presented below areintended to be illustrative. In some implementations, method 700 may beaccomplished with one or more additional operations not described,and/or without one or more of the operations discussed. In someimplementations, two or more of the operations may occur substantiallysimultaneously.

Referring to FIG. 7 and method 700, at operation 701, visual outputsignals conveying visual information within a field of view of an imagesensor may be generated. Visual information may include one or more ofan image, a video, and/or other visual information. In someimplementations, operation 701 may be performed by one or more sensorsthe same as or similar to image sensor 12 (shown in FIG. 1 and describedherein).

At operation 702, motion and orientation output signals conveying motioninformation and orientation information of the image sensor may begenerated. Motion information may include one or more motion informationregarding speed of the image sensor, distance traveled by the imagesensor, and/or movement of the image sensor, including one or more ofmoving forward, moving backwards, moving right, moving left, moving up,moving down, and/or other movement, and/or other motion information.Orientation information may include one or more orientation informationregarding orientation of the image sensor, including one or more ofturning right, turning left, rolling right, rolling left, pitching up,pitching down, and/or other orientation information. In someimplementations, operation 702 may be performed by one or more sensorsthe same as or similar to motion and orientation senor 13 (shown in FIG.1 and described herein).

At operation 703, a first image based on the visual output signals maybe obtained. The visual output signals used to determine the first imagemay be generated by the image sensor at a first time. In someimplementations, operation 703 may be performed by a processor componentthe same as or similar to imaging component 20 (shown in FIG. 1 anddescribed herein).

At operation 704, a second image based on the visual output signals maybe obtained. The visual output signals used to determine the secondimage may be generated by the image sensor at a second time. The secondtime may be subsequent to the first time. In some implementations,operation 704 may be performed by a processor component the same as orsimilar to imaging component 20 (shown in FIG. 1 and described herein).

At operation 705, a predicted change in the field of view of the imagesensor between the first time and the second time may be obtained. Insome implementations, operation 705 may be performed by a processorcomponent the same as or similar to predicted change component 21 (shownin FIG. 1 and described herein).

At operation 706, a predicted first image may be determined by adjustingthe second image based on the predicted change in the field of view ofthe image sensor. In some implementations, operation 706 may beperformed by a processor component the same as or similar to predictedimaging component 22 (shown in FIG. 1 and described herein).

At operation 707, an actual change in the field of view of the imagesensor between the first time and the second time may be obtained. Theactual change in the field of view of the image sensor between the firsttime and the second time may be determined based on a comparison of thefirst image with the predicted first image. In some implementations,operation 707 may be performed by a processor component the same as orsimilar to actual change component 23 (shown in FIG. 1 and describedherein).

At operation 708, optical flow between the first image and the secondimage may be obtained. In some implementations, operation 708 may beperformed by a processor component the same as or similar to opticalflow component 24 (shown in FIG. 1 and described herein).

At operation 709, an adjusted optical flow may be obtained. The adjustedoptical flow may be determined by adjusting the optical flow based onthe actual change in the field of view of the image sensor. In someimplementations, operation 709 may be performed by a processor componentthe same as or similar to optical flow adjustment component 25 (shown inFIG. 1 and described herein).

At operation 710, the presence of the moving object may be obtained. Thepresence of the moving object may be detected based on the adjustedoptical flow. In some implementations, operation 710 may be performed bya processor component the same as or similar to detection component 26(shown in FIG. 1 and described herein).

FIG. 11 illustrates method 1100 for collision avoidance of an unmannedaerial vehicle. The operations of method 1100 presented below areintended to be illustrative. In some implementations, method 1100 may beaccomplished with one or more additional operations not described,and/or without one or more of the operations discussed. In someimplementations, two or more of the operations may occur substantiallysimultaneously.

Referring to FIG. 11 and method 1100, at operation 1101, depthinformation for an environment around an unmanned aerial vehicle may beobtained. The depth information may characterize one or more distancesbetween the unmanned aerial vehicle and the environment around theunmanned aerial vehicle. In some implementations, operation 1101 may beperformed by a processor component the same as or similar to depthinformation component 820 (shown in FIG. 8 and described herein).

At operation 1102, a spherical depth map may be generated from the depthinformation. The spherical depth map may represent distances to closestsurfaces of the environment around the unmanned aerial vehicle as afunction of longitude angles and latitude angles. In someimplementations, operation 1102 may be performed by a processorcomponent the same as or similar to spherical depth map component 821(shown in FIG. 8 and described herein).

At operation 1103, maneuver controls for the unmanned aerial vehicle maybe provided based on the spherical map. The maneuver controls for theunmanned aerial vehicle may include controlling the unmanned aerialvehicle to avoid one or more objects in the environment around theunmanned aerial vehicle. In some implementations, operation 1103 may beperformed by a processor component the same as or similar to maneuvercontrols component 821 (shown in FIG. 8 and described herein).

FIG. 14 illustrates method 1400 for collision avoidance of an unmannedaerial vehicle. The operations of method 1400 presented below areintended to be illustrative. In some implementations, method 1400 may beaccomplished with one or more additional operations not described,and/or without one or more of the operations discussed. In someimplementations, two or more of the operations may occur substantiallysimultaneously.

Referring to FIG. 14 and method 1400, at operation 1401, objects in anenvironment around an unmanned aerial vehicle may be detected. Theobjects may include one or more stationary object and/or one or moremoving objects. In some implementations, operation 1401 may be performedby a processor component the same as or similar to object detectioncomponent 1220.

At operation 1402, depth information for the environment around theunmanned aerial vehicle may be obtained. The depth information maycharacterize one or more distances between the unmanned aerial vehicleand the environment around the unmanned aerial vehicle.

The environment around the unmanned aerial vehicle may include one ormore detected stationary objects and/or one or more detected movingobjects. In some implementations, operation 1402 may be performed by aprocessor component the same as or similar to depth informationcomponent 1221 (shown in FIG. 12 and described herein).

At operation 1403, a depth map may be generated from the depthinformation. The depth map may characterize distances between theunmanned aerial vehicle and one or more detected stationary objectsand/or one or more detected moving objects. The accuracy of the depthmap may be characterized by a depth map accuracy and/or otherinformation. In some implementations, operation 1403 may be performed bya processor component the same as or similar to depth map component 1222(shown in FIG. 12 and described herein).

At operation 1404, vehicle physical model information may be obtained.The vehicle physical model information may characterize the motion ofthe unmanned aerial vehicle. The accuracy of the vehicle physical modelinformation may be characterized by a vehicle physical model accuracyand/or other information. In some implementations, operation 1404 may beperformed by a processor component the same as or similar to physicalmodel information component 1223 (shown in FIG. 12 and describedherein).

At operation 1405, a predicted vehicle path may be determined. Thepredicted vehicle path may include predicted location(s) of the unmannedaerial vehicle at one or more future times. In some implementations,operation 1405 may be performed by a processor component the same as orsimilar to predicted path component 1224 (shown in FIG. 12 and describedherein).

At operation 1406, responsive to detecting a stationary object, it maybe determined whether the unmanned aerial vehicle moving in thepredicted vehicle path will collide with the detected stationary object.One or more collisions may be determined at one or more locations and/orat one or more future times. In some implementations, operation 1406 maybe performed by a processor component the same as or similar tocollision detection component 1225 (shown in FIG. 12 and describedherein).

At operation 1407, responsive to detecting a moving object, movingobject physical model information may be obtained. The moving objectphysical model information may characterize the motion of one or moremoving objects. The accuracy of the moving object physical modelinformation may be characterized by a moving object physical modelaccuracy and/or other information. In some implementations, operation1407 may be performed by a processor component the same as or similar tophysical model information component 1223 (shown in FIG. 12 anddescribed herein).

At operation 1408, responsive to detecting the moving object, apredicted moving object path may be determined. The predicted movingobject path may include predicted location(s) of the moving object atone or more future times. In some implementations, operation 1408 may beperformed by a processor component the same as or similar to predictedpath component 1224 (shown in FIG. 12 and described herein).

At operation 1409, responsive to detecting the moving object, it may bedetermined whether the unmanned aerial vehicle moving in the predictedvehicle path will collide with the detected moving object. One or morecollisions may be determined at one or more locations and/or at one ormore future times. In some implementations, operation 1409 may beperformed by a processor component the same as or similar to collisiondetection component 1225 (shown in FIG. 12 and described herein).

At operation 1410, responsive to determining a collision, a velocity ofthe unmanned aerial vehicle may be changed to avoid the collision.Changing the velocity of the unmanned aerial vehicle may includechanging one or more speeds of the unmanned aerial vehicle. Changing thevelocity of the unmanned aerial vehicle may include changing one or moredirections of the unmanned aerial vehicle to move the unmanned aerialvehicle in a path that deviates from the predicted vehicle path. In someimplementations, operation 1410 may be performed by a processorcomponent the same as or similar to collision avoidance component 1226(shown in FIG. 12 and described herein).

FIG. 20 illustrates method 2000 for ego-motion estimation. Theoperations of method 2000 presented below are intended to beillustrative. In some implementations, method 2000 may be accomplishedwith one or more additional operations not described, and/or without oneor more of the operations discussed. In some implementations, two ormore of the operations may occur substantially simultaneously.

Referring to FIG. 20 and method 2000, at operation 2001, first visualoutput signals and second visual output signals may be generated. Thefirst visual output signals may be generated by a first image sensor ofa stereo image sensor. The second visual output signals may be generatedby a second image sensor of the stereo image sensor. The first visualoutput signals may convey visual information within a field of view of afirst image sensor. The second visual output signals may convey visualinformation within a field of view of a first image sensor. In someimplementations, operation 2001 may be performed by one or more sensorsthe same as or similar to stereo image sensor 1512, first image sensor1531, and/or second image sensor 1532 (shown in FIG. 15 and describedherein).

At operation 2002, motion and orientation output signals may begenerated. The motion and orientation output signals may convey motioninformation and orientation information of the stereo image sensor. Insome implementations, operation 2002 may be performed by a sensor thesame as or similar to motion and orientation sensor 1513 (shown in FIG.15 and described herein).

At operation 2003, a first image and a second image may be obtained,undistorted and rectified. The first image may be determined based onthe first visual output signals such that the first visual outputsignals used to determine the first image are generated by the firstimage sensor at a first time. The second image may be determined basedon the second visual output signals such that the second visual outputsignals used to determine the second image are generated by the secondimage sensor at the first time. In some implementations, operation 2003may be performed by a processor component the same as or similar toimaging component 1520 (shown in FIG. 15 and described herein).

At operation 2004, a depth image may be determined. The depth image maybe determined based on a comparison of the first image and the secondimage. The depth image may include pixel values characterizing distancesbetween the stereo image sensor and objects within the first image andthe second image. In some implementations, operation 2004 may beperformed by a processor component the same as or similar to depth imagecomponent 1521 (shown in FIG. 15 and described herein).

At operation 2005, a third image and a fourth image may be obtained,undistorted and rectified. The third image may be determined based onthe first visual output signals such that the first visual outputsignals used to determine the first image are generated by the firstimage sensor at a second time that is subsequent to the first time. Thefourth image may be determined based on the second visual output signalssuch that the second visual output signals used to determine the secondimage are generated by the second image sensor at the second time. Insome implementations, operation 2005 may be performed by a processorcomponent the same as or similar to imaging component 1520 (shown inFIG. 15 and described herein).

At operation 2006, a predicted motion of a stereo image sensor may beobtained. The predicted motion of the stereo image sensors may bedetermined based on the motion and orientation output signals. In someimplementations, operation 2006 may be performed by a processorcomponent the same as or similar to predicted motion component 1522(shown in FIG. 15 and described herein).

At operation 2007, a predicted third image and a predicted fourth imagemay be determined. The predicted third image may be determined byadjusting the first image based on the predicted motion of the stereoimage sensor, the depth image, and/or other information. The predictedfourth image may be determined by adjusting the second image based onthe predicted motion of the image sensor, the depth image, and/or otherinformation. In some implementations, operation 2007 may be performed bya processor component the same as or similar to predicted imagingcomponent 1523 (shown in FIG. 15 and described herein).

At operation 2008, an error distribution of the predicted motion may bedetermined. The error distribution of the predicted motion may bedetermined by comparing the third image with the predicted third imageand the fourth image with the predicted fourth image. In someimplementations, operation 2008 may be performed by a processorcomponent the same as or similar to error component 1524 (shown in FIG.15 and described herein).

At operation 2009, an estimated actual motion of the stereo imagesensors may be obtained. The estimated actual motion of the stereo imagesensors may be determined by adjusting the predicted motion of thestereo image sensor to reduce the error distribution of the predictedmotion of the stereo image sensor. In some implementations, operation2009 may be performed by a processor component the same as or similar toactual motion component 1525 (shown in FIG. 15 and described herein).

FIG. 25 illustrates method 2500 for refining a depth image. Theoperations of method 2500 presented below are intended to beillustrative. In some implementations, method 2500 may be accomplishedwith one or more additional operations not described, and/or without oneor more of the operations discussed. In some implementations, two ormore of the operations may occur substantially simultaneously.

Referring to FIG. 25 and method 2500, at operation 2501, first visualoutput signals and second visual output signals may be generated. Thefirst visual output signals may be generated by a first image sensor ofa stereo image sensor. The second visual output signals may be generatedby a second image sensor of the stereo image sensor. The first visualoutput signals may convey visual information within a field of view of afirst image sensor. The second visual output signals may convey visualinformation within a field of view of a first image sensor. In someimplementations, operation 2501 may be performed by one or more sensorsthe same as or similar to stereo image sensor 2112, first image sensor2131, and/or second image sensor 2132 (shown in FIG. 21 and describedherein).

At operation 2502, a first image and a second image may be obtained,undistorted and rectified. The first image may be determined based onthe first visual output signals such that the first visual outputsignals used to determine the first image are generated by the firstimage sensor at a first time. The second image may be determined basedon the second visual output signals such that the second visual outputsignals used to determine the second image are generated by the secondimage sensor at the first time. In some implementations, operation 2502may be performed by a processor component the same as or similar toimaging component 2120 (shown in FIG. 21 and described herein).

At operation 2503, a first depth image may be determined. The firstdepth image may be determined based on a comparison of the first imageand the second image. The depth image may include pixel valuescharacterizing distances between the stereo image sensor and objectswithin the first image and the second image. In some implementations,operation 2503 may be performed by a processor component the same as orsimilar to depth image component 2121 (shown in FIG. 21 and describedherein).

At operation 2504, a predicted second image may be determined. Thepredicted second image may be determined by adjusting the first imagebased on the first depth image, and/or other information. In someimplementations, operation 2504 may be performed by a processorcomponent the same as or similar to predicted imaging component 2123(shown in FIG. 21 and described herein).

At operation 2505, the first depth image may be refined based on acomparison of the second image with the predicted second image. Refiningthe first depth image based on the comparison of the second image withthe predicted second image may include determining a mismatch between animage patch in the second image with a corresponding image patch in thepredicted second image. Responsive to the mismatch meeting and/orexceeding an error threshold, one or more pixel values of the firstdepth image corresponding to the image patch may be rejected. In someimplementations, operation 2505 may be performed by a processorcomponent the same as or similar to refine component 2124 (shown in FIG.21 and described herein).

At operation 2506, a third image may be obtained, undistorted andrectified. The third image may be determined based on the first visualoutput signals such that the first visual output signals used todetermine the first image are generated by the first image sensor at asecond time that is subsequent to the first time. In someimplementations, operation 2506 may be performed by a processorcomponent the same as or similar to imaging component 2120 (shown inFIG. 21 and described herein).

At operation 2507, a predicted motion of a stereo image sensor may bedetermined. In some implementations, the predicted motion of the stereoimage sensor may be determined based on motion and orientation outputsignals. The motion and orientation output signals may convey motioninformation and orientation information of the stereo image sensor. Insome implementations, the predicted motion may be determined based onvisual odometry. In some implementations, operation 2507 may beperformed by a processor component the same as or similar to predictedmotion component 2122 (shown in FIG. 21 and described herein).

At operation 2508, a predicted third image may be determined. Thepredicted third image may be determined by adjusting the first imagebased on the predicted motion, the first depth image, and/or otherinformation. In some implementations, operation 2508 may be performed bya processor component the same as or similar to predicted imagingcomponent 2123 (shown in FIG. 21 and described herein).

At operation 2509, the first depth image may be refined based on acomparison of the third image with the predicted third image. Refiningthe first depth image based on the comparison of the third image withthe predicted third image may include determining a mismatch between animage patch in the third image with a corresponding image patch in thepredicted third image. Responsive to the mismatch meeting and/orexceeding an error threshold, one or more pixel values of the firstdepth image corresponding to the image patch may be rejected. In someimplementations, operation 2509 may be performed by a processorcomponent the same as or similar to refine component 2124 (shown in FIG.21 and described herein).

In some implementations, one or more of methods 700, 1100, 1400, 2000,2500 may be implemented in one or more processing devices (e.g., adigital processor, an analog processor, a digital circuit designed toprocess information, a central processing unit, a graphics processingunit, a microcontroller, an analog circuit designed to processinformation, a state machine, and/or other mechanisms for electronicallyprocessing information). The one or more processing devices may includeone or more devices executing some or all of the operations of methods700, 1100, 1400, 2000, 2500 in response to instructions storedelectronically on one or more electronic storage mediums. The one ormore processing devices may include one or more devices configuredthrough hardware, firmware, and/or software to be specifically designedfor execution of one or more of the operations of methods 700, 1100,1400, 2000, 2500.

Although the system(s) and/or method(s) of this disclosure have beendescribed in detail for the purpose of illustration based on what iscurrently considered to be the most practical and preferredimplementations, it is to be understood that such detail is solely forthat purpose and that the disclosure is not limited to the disclosedimplementations, but, on the contrary, is intended to covermodifications and equivalent arrangements that are within the spirit andscope of the appended claims. For example, it is to be understood thatthe present disclosure contemplates that, to the extent possible, one ormore features of any implementation can be combined with one or morefeatures of any other implementation.

What is claimed is:
 1. An aerial vehicle, comprising: a sensorgenerating information from an environment around the aerial vehicle;and a processing apparatus coupled to a memory storing instructions thatwhen executed causes the processing apparatus to: generate a depth mapbased on the environment, the depth map including maps cellscorresponding to longitude angles and to latitude angles, wherein thedepth map represents distances to closest surfaces of the environmentaround the aerial vehicle, and wherein the depth map is a function oflongitude angles and latitude angles; and provide controls for theaerial vehicle based on the depth map; and wherein to generate the depthmap comprises to: determine distance values for the map cells based onthe distances to the closest surfaces of the environment around theaerial vehicle, wherein a first distance value for a first map cell isdetermined based on a first distance to a first closest surface of theenvironment around the aerial vehicle at a first longitude angle and afirst latitude angle.
 2. The aerial vehicle of claim 1, wherein thedepth map includes a dead zone in a polar region, and the distancevalues are not determined for the map cells in the dead zone.
 3. Theaerial vehicle of claim 1, wherein the determination of the distancevalues disregards distances to surfaces of the environment around theaerial vehicle at individual longitude angles and individual latitudeangles that are greater than the distances to the closest surfaces ofthe environment around the aerial vehicle at the individual longitudeangles and the individual latitude angles.
 4. The aerial vehicle ofclaim 3, wherein the distance values correspond only to the closestsurfaces of the environment around the aerial vehicle at the individuallongitude angles and the individual latitude angles.
 5. The aerialvehicle of claim 1, wherein the sensor is a stereo image sensorcomprising: a first image sensor configured to generate first visualoutput signals providing first visual information within a first fieldof view of the first image sensor; and a second image sensor configuredto generate second visual output signals providing second visualinformation within a second field of view of the second image sensor,wherein the depth map is generated by comparing the first visualinformation with the second visual information.
 6. The aerial vehicle ofclaim 1, wherein the processing apparatus comprises ahardware-implemented processor and a software-implemented processor. 7.The aerial vehicle of claim 6, wherein the hardware-implementedprocessor is located remotely from the software-implemented processor.8. The aerial vehicle of claim 1, further including instructions thatwhen executed causes the processing apparatus to: responsive todetecting the aerial vehicle traveling a threshold distance, transformthe depth map such that the center of the depth map coincides with alocation of the aerial vehicle.
 9. A method, comprising: generating adepth map based on an environment of an unmanned aerial vehicle, thedepth map including maps cells corresponding to longitude angles and tolatitude angles, wherein the depth map represents distances to closestsurfaces of the environment around the aerial vehicle, wherein the depthmap is a function of longitude angles and latitude angles; and providingcontrols for the unmanned aerial vehicle based on the depth map; whereinto generate the depth map comprises to: determine distance values forthe map cells based on the distances to the closest surfaces of theenvironment around the aerial vehicle, wherein a first distance valuefor a first map cell is determined based on a first distance to a firstclosest surface of the environment around the aerial vehicle at a firstlongitude angle and a first latitude angle.
 10. The method of claim 9,wherein the depth map includes a dead zone in a polar region, and thedistance values are not determined for the map cells in the dead zone.11. The method of claim 9, wherein the determination of the distancevalues disregards distances to surfaces of the environment around theaerial vehicle at individual longitude angles and individual latitudeangles that are greater than the distances to the closest surfaces ofthe environment around the aerial vehicle at the individual longitudeangles and the individual latitude angles.
 12. The method of claim 9,comprising: generating first visual output signals conveying firstvisual information within a first field of view of a first image sensorof the unmanned aerial vehicle; and generating second visual outputsignals conveying second visual information within a second field ofview of a second image sensor of the unmanned aerial vehicle, whereinthe depth map is generated by comparing the first visual informationwith the second visual information.
 13. The method of claim 9,comprising: responsive to detecting the unmanned aerial vehicletraveling a threshold distance, transforming the depth map such that thecenter of the depth map coincides with a location of the unmanned aerialvehicle.
 14. A non-transitory computer-readable medium includinginstructions that, when executed by one or more processors, cause theone or more processors to: generate visual information from anenvironment around an unmanned aerial vehicle; generate a depth mapbased on the environment, the depth map including maps cellscorresponding to longitude angles and to latitude angles, wherein thedepth map represents distances to closest surfaces of the environmentaround the aerial vehicle, wherein the depth map is a function oflongitude angles and latitude angles; providing controls for theunmanned aerial vehicle based on the depth map; and determine distancevalues for the map cells based on the distances to the closest surfacesof the environment around the aerial vehicle, wherein a first distancevalue for a first map cell is determined based on a first distance to afirst closest surface of the environment around the aerial vehicle at afirst longitude angle and a first latitude angle.
 15. The non-transitorycomputer-readable medium of claim 14, wherein the depth map includes adead zone in a polar region, and the distance values are not determinedfor the map cells in the dead zone.
 16. The non-transitorycomputer-readable medium of claim 14, wherein the determination of thedistance values disregards distances to surfaces of the environmentaround the aerial vehicle at individual longitude angles and individuallatitude angles that are greater than the distances to the closestsurfaces of the environment around the aerial vehicle at the individuallongitude angles and the individual latitude angles.
 17. Thenon-transitory computer-readable medium of claim 16, wherein thedistance values correspond only to the closest surfaces of theenvironment around the aerial vehicle at the individual longitude anglesand the individual latitude angles.
 18. The non-transitorycomputer-readable medium of claim 14, wherein the unmanned aerialvehicle includes a sensor that is a stereo image sensor comprising: afirst image sensor configured to generate first visual output signalsproviding first visual information within a first field of view of thefirst image sensor; and a second image sensor configured to generatesecond visual output signals providing second visual information withina second field of view of the second image sensor, wherein the depth mapis generated by comparing the first visual information with the secondvisual information.
 19. The non-transitory computer-readable medium ofclaim 14, wherein the one or more processors comprise ahardware-implemented processor and a software-implemented processor; andwherein the hardware-implemented processor is located remotely from thesoftware-implemented processor.
 20. The non-transitory computer-readablemedium of claim 14, further including instructions that when executedcauses the processing apparatus to: responsive to detecting the aerialvehicle traveling a threshold distance, transform the depth map suchthat the center of the depth map coincides with a location of the aerialvehicle.